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Preface 



Norway is a country rich on natural resources. Wind, rain and snow provide us with 
a huge resource for clean energy production, while oil and gas have contributed 
significantly, since the early 1970s, to the country’s economic wealth. Nowadays 
the income from oil and gas exploitation is invested in the world’s financial markets 
to ensure the welfare of future generations. With the rising global concerns about 
climate, using renewable resources for power generation has become more and 
more important. Bad management of these resources will be a waste that is a 
negligence to avoid given the right tools. 

This formed the background and motivation for the research group Stochastics 
for Environmental and Financial Economics (SEFE) at the Centre of Advanced 
Studies (CAS) in Oslo, Norway. During the academic year 2014-2015, SEFE 
hosted a number of distinguished professors from universities in Belgium, France, 
Germany, Italy, Spain, UK and Norway. The scientific purpose of the SEFE centre 
was to focus on the analysis and management of risk in the environmental and 
financial economics. New mathematical models for describing the uncertain 
dynamics in time and space of weather factors like wind and temperature were 
studied, along with sophisticated theories for risk management in energy, com- 
modity and more conventional financial markets. 

In September 2014 the research group organized a major international confer- 
ence on the topics of SEFE, with more than 60 participants and a programme 
running over five days. The present volume reflects some of the scientific devel- 
opments achieved by CAS fellows and invited speakers at this conference. All the 
14 chapters are stand-alone, peer-reviewed research papers. The volume is divided 
into two parts; the first part consists of papers devoted to fundamental aspects of 
stochastic analysis, whereas in the second part the focus is on particular applications 
to environmental and financial economics. 
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We thank CAS for its generous support and hospitality during the academic year 
we organized our SEFE research group. We enjoyed the excellent infrastructure 
CAS offered for doing research. 

Oslo, Norway Ered Espen Benth 

June 2015 Giulia Di Nunno 
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Part I 
Foundations 




Some Recent Developments in Ambit 
Stochastics 



Ole E. Barndorff-Nielsen, Emil Hedevang, Jurgen Schmiegel 
and Benedykt Szozda 



Abstract Some of the recent developments in the rapidly expanding field of Ambit 
Stochastics are here reviewed. After a brief recall of the framework of Ambit Sto- 
chastics, two topics are considered: (i) Methods of modelling and inference for 
volatility/intermittency processes and fields; (ii) Universal laws in turbulence and 
finance in relation to temporal processes. This review complements two other recent 
expositions. 

Keywords Ambit stochastics • Stochastic volatility/intermittency • Universality • 
Finance • Turbulence • Extended subordination • Metatimes • Time-change 



1 Introduction 

Ambit Stochastics is a general framework for the modelling and study of dynamic 
processes in space-time. The present paper outlines some of the recent developments 
in the area, with particular reference to finance and the statistical theory of turbu- 
lence. Two recent papers [8, 36] provide surveys that focus on other sides of Ambit 
Stochastics. 
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A key characteristic of the Ambit Stochastics framework, which distinguishes this 
from other approaches, is that beyond the most basic kind of random input it also 
specifically incorporates additional, often drastically changing, inputs referred to as 
volatility or intermittency. 

Such “additional” random fluctuations generally vary, in time and/or in space, in 
regard to intensity (activity rate and duration) and amplitude. Typically the volatil- 
ity/intermittency may be further classified into continuous and discrete (i.e. jumps) 
elements, and long and short term effects. In turbulence the key concept of energy 
dissipation is subsumed under that of volatility/intermittency. 

The concept of (stochastic) volatility/intermittency is of major importance in many 
fields of science. Thus volatility/intermittency has a central role in mathematical 
finance and financial econometrics, in turbulence, in rain and cloud studies and 
other aspects of environmental science, in relation to nanoscale emitters, magne- 
tohydrodynamics, and to liquid mixtures of chemicals, and last but not least in the 
physics of fusion plasmas. 

As described here, volatility/intermittency is a relative concept, and its mean- 
ing depends on the particular setting under investigation. Once that meaning is 
clarified the question is how to assess the volatility/intermittency empirically and 
then to describe it in stochastic terms, for incorporation in a suitable probabilistic 
model. Important issues concern the modelling of propagating stochastic volatil- 
ity/intermittency fields and the question of predictability of volatility/intermittency. 

Section 2 briefly recalls some main aspects of Ambit Stochastics that are of rel- 
evance for the dicussions in the subsequent sections, and Sect. 3 illustrates some of 
the concepts involved by two examples. The modelling of volatility/intermittency 
and energy dissipation is a main theme in Ambit Stochastics and several approaches 
to this are discussed in Sect. 4. A leading principle in the development of Ambit 
Stochastics has been to take the cue from recognised stylised features — or universal- 
ity traits — in various scientific areas, particularly turbulence, as the basis for model 
building; and in turn to seek new such traits using the models as tools. We discuss 
certain universal features observed in finance and turbulence and indicate ways to 
reproduce them in Sect. 5. Section 6 concludes and provides an outlook. 



2 Ambit Stochastics 
2.1 General Framework 

In terms of mathematical formulae, in its original form [17] (cf. also [16]) an ambit 
field is specified by 




g{x, I, t, ^)cr(f , s) L{d^ ds) + Q{x, t) (1) 



Some Recent Developments in Ambit Stochastics 



5 



time 




Fig.l A Spatio-temporal ambit field. The value Y(x,t) of the field at the point marked by the black 
dot is defined through an integral over the corresponding ambit set A(x,t) marked by the shaded 
region. The circles of varying sizes indicate the stochastic volatility/intermittency. By considering 
the field along the dotted path in space-time an ambit process is obtained 



where 

Q{x,t)=[ q(x,^,t,s)x(^,s)d^ ds. (2) 

J D{x,t) 

Here t denotes time while x gives the position in J-dimensional Euclidean space. 
Further, A{x,t) and D{x,t) are subsets of x M and are termed ambit sets, g and q 
are deterministic weight functions, and L denotes a Levy basis (i.e. an independently 
scattered and infinitely divisible random measure). Further, a and x are stochastic 
fields representing aspects of the volatility/intermittency. In Ambit Stochastics the 
models of the volatility/intermittency fields a and x are usually themselves specified 
as ambit fields. We shall refer to a as the amplitude volatility component. Figure 1 
shows a sketch of the concepts. 

The development of Y along a curve in space-time is termed an ambit process. 
As will be exemplified below, ambit processes are not in general semimartingales, 
even in the purely temporal case, i.e. where there is no spatial component x. 
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In a recent extension the structure (1) is generalised to 



F(x,0 = /^+ / g{x,^,t,s)G{^,s)LT{d^ds)^~ Q{x,t) (3) 



where Q is like (2) or the exponential thereof, and where T is a metatime expressing 
a further volatility/intermittency trait. The relatively new concept of metatime is 
instrumental in generalising subordination of stochastic processes by time change 
(as discussed for instance in [22]) to subordination of random measures by random 
measures. We return to this concept and its applications in the next section and refer 
also to the discussion given in [8]. 

Note however that in addition to modelling volatility/intermittency through the 
components a, x ^nd T, in some cases this may be supplemented by probability 
mixing or Levy mixing as discussed in [12]. 

It might be thought that ambit sets have no role in purely temporal modelling. 
However, examples of their use in such contexts will be discussed in Sect. 3. 

In many cases it is possible to choose specifications of the volatility/intermittency 
elements a , x and T such that these are infinitely divisible or even selfdecomposable, 
making the models especially tractable analytically. We recall that the importance 
of the concept of selfdecomposability rests primarily on the possibility to represent 
selfdecomposable variates as stochastic integrals with respect to Levy processes, 
see [32]. 

So far, the main applications of ambit stochastics has been to turbulence and, to 
a lesser degree, to financial econometrics and to bioimaging. An important potential 
area of applications is to particle transport in fluids. 



2.2 Existence of Ambit Fields 

The paper [25] develops a general theory for integrals 



where is a predictable stochastic function and M is a dispersive signed random 
measure. Central to this is that the authors establish a notion of characteristic triplet 
of M, extending that known in the purely temporal case. A major problem solved in 
that regard has been to merge the time and space aspects in a general and tractable 
fashion. Armed with that notion they determine the conditions for existence of the 
integral, analogous to those in [37] but considerably more complicated to derive and 
apply. An important property here is that now predictable integrands are allowed 
(in the purely temporal case this was done in [23]). Applications of the theory to 
Ambit Stochastics generally, and in particular to superposition of stochastic volatility 
models, is discussed. 



A{x,t) 




h(x, y, L s) M{dx dx) 
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Below we briefly discuss how the metatime change is incorporated in the frame- 
work of [25]. Suppose that L = {L{A) \ A g is a real- valued, homo- 
geneous Levy basis with associated inflnitely divisible law /x g that is 

L([0, 1]'^+^) is equal in law to /x. Let (y, U, v) be the characteristic triplet of /x. 
Thus y G R, 27 > 0 and v is a Levy measure on R. 

Suppose that T = {T(A) | A g is a random meta-time associated 

with a homogeneous, real-valued, non-negative Levy basis T = [T (A) | A g 
^(R^/+i)} jYiSit is the sets T(A) and T(5) are disjoint whenever A, B e 
are disjoint, T(U^qA„) = U^qT(A„) whenever A„, U^qA„ g and 

T(A) = Lebj+i(T(A)) for all A g Here and in what follows, Leby^ 

denotes the Lebesgue measure on R^. For the details on construction of random 
meta- times cf. [1 1]. Suppose also that k g 7D(R) is the law associated to T and that 
k ~ 7D()0, 0, p). Thus > 0 and p is a Levy measure such that p(R_) = 0 and 
Jj^(l A x) p{dx) < 00 . 

Now, by [11, Theorem 5.1] we have that Lt = {T(T(A)) | A g ^(R^+^)} 
is a homogeneous Levy basis associated to /x^ with /x^ ~ ID(y^, v^) and 
characteristics given by 



y=^y + 

= /3E 



n 

Jo J\x\<l 



x/ji^ (dx)p(ds) 



L 



v^{B) = pv(B) + I iJ,^{B)p{ds), B e \ {0}) 



where pi^ is given by pi^ = for any 5’ > 0. 

Finally, suppose that a(x,t) is predictable and that Lt has no fixed times of 
discontinuity (see [25]). By rewriting the stochastic integral in the right-hand side of 
(3) as 

X{xj)= / H{x,^J,s)LT{d^ds), 

Jm^+1 



with 77 (v, t, s) = 1a(x,o(§’ L s)cr(^, s) we can use [25, Theorem 4.1]. 

Observe that the assumption that a is predictable is enough as both A(x,t) and 
g(x, t, s) Sive deterministic. This gives us that X is well defined for all (x, t) if the 
following hold almost surely for all (x,t) G R'^+^i 



I 

jRd+l 



7 . 






I H^{x, t, s)S^ d^ds < oo 

iR‘' ' ■ 



d^ds < 00 

( 4 ) 

( 5 ) 



I [ 

Jw+i m 



(1 A (H(x, f , f, s)y)'^ v*(dy)d^ds < oo. 



( 6 ) 
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3 Illustrative Examples 

We can briefly indicate the character of some of the points on Ambit Stochastics 
made above by considering the following simple model classes. 



3.1 BSS and LSS Processes 

Stationary processes of the form 

i"(0= f g(t - s)a{s)BT{ds) + f q{t — s)a{s)^ ds. (7) 

J —OQ J —OO 

are termed Brownian semistationary processes — or BSS for short. Here the setting 
is purely temporal and Bt is the time change of Brownian motion 5 by a chronome- 
ter T (that is, an increasing, cMlag and stochastically continuous process ranging 
from — OO to oo), and the volatility/intermittency process a is assumed stationary. 
The components cr and T represent respectively the amplitude and the intensity of 
the volatility/intermittency. If T has stationary increments then the process Y is sta- 
tionary. The process (7) can be seen as a stationary analogue of the BNS model 
introduced by Bamdorff-Nielsen and Shephard [14]. 

Note that in case T increases by jumps only, the infinitesimal of the process Bj 
cannot be reexpressed in the form x {s)B{ds), as would be the case if T was of type 
Tt = /q if{u) du with X = V^- 

Further, for the exemplification we take g to be of the gamma type 

gW = (8) 

Subject to a weak (analogous to (4)) condition on a, the stochastic integral in (7) 
will exist if and only if v > 1/2 and then Y constitutes a stationary process in time. 
Moreover, 7 is a semimartingale if and only if v does not lie in one of the intervals 
( 1 /2 , 1 ) and (1,3/2]. Note also that the sample path behaviour is drastically different 
between the two intervals, since, as r ^ 0, g(0 tends to oo when v g (1/2, IjandtoO 
when V e (1,3/2]. Further, the sample paths are purely discontinuous ifv g (1/2, 1) 
but purely continuous (of Holder index H = v — 1/2) when v G (1,3/2). 

The cases where v g (1/2, 1) have a particular bearing in the context of turbulence, 
the value v = 5/6 having a special role in relation to the Kolmogorov-Obukhov 
theory of statistical turbulence, cf. [3, 33]. 

The class of processes obtained by substituting the Brownian motion in (7) by a 
Levy process is referred to as the class of Levy semistationary processes — or LSS 
processes for short. Such processes are discussed in [8, 24, Sect. 3.7] and references 
therein. 
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3.2 Trawl Processes 

The simplest non-trivial kind of ambit field is perhaps the trawl process, introduced 
in [2]. In a trawl process, the kernel function and the volatility field are constant 
and equal to 1, and so the process is given entirely by the ambit set and the Levy 
basis. Specifically, suppose that L is a homogeneous Levy basis on x M and that 
A c X M is a Borel subset with finite Lebesgue measure, then we obtain a trawl 
process Y by letting A(t) = A (0, t) and 



The process is by construction stationary. Depending on the purpose of the modelling, 
the time component of the ambit set A may or may not be supported on the negative 
real axis. When the time component of A is supported on the negative real axis, 
we obtain a causal model. Despite their apparent simplicity, trawl processes possess 



function (i.e. the distinguished logarithm of the characteristic function) of Y is given 
by 



Here and later, | A | denotes the Lebesgue measure of the set A. For the mean, variance, 
autocovariance and autocorrelation it follows that 



From this we conclude the following. The one-dimensional marginal distribution 
is determined entirely in terms of the size (not shape) of the ambit set and the 
distribution of the Levy seed; given any infinitely divisible distribution there exists 
trawl processes having this distribution as the one-dimensional marginal; and the 
autocorrelation is determined entirely by the size of the overlap of the ambit sets, 
that is, by the shape of the ambit set A. Thus we can specify the autocorrelation 
and marginal distribution independently of each other. It is, for example, easy to 
construct a trawl process with the same autocorrelation as the OU process, see [2, 
8] for more results and details. By using integer-valued Levy bases, integer-valued 



Y(t) = 




L{d^ ds) = / 1a(§, t - s) L{d^ ds) = L{A{t)) (9) 



enough flexibility to be of use. If L' denotes the seed^ of L, then the cumulant 



CRtT(0} = IAICRJL'}. 



( 10 ) 



E[Y(t)] = \A\E[L'], 
var(F(0) = |A|var(L^, 



r(t) := cov(F(0, T(0)) = |A H A(0I var(L'), 
cov(F(0, T(0)) |AnA(OI 
^ var(T(0)) |A| 



( 11 ) 



^The Levy seed L'(x) at x of a Levy basis L with control measure v is a random variable with the 
property that C{^fL (A)} = C{^fL'(x)} v(Jx). For a homogeneous Levy basis, the distribution 

of the seed is independent of x. 
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trawl processes are obtained. These processes are studied in detail in [5] and applied 
to high frequency stock market data. 

We remark, that Y{x,t) = L(A + (x, t)) is an immediate generalisation of trawl 
processes to trawl fields. It has the same simple properties as the trawl process. 

Trawl processes can be used to directly model an object of interest, for example, 
the exponential of the trawl process has been used to model the energy dissipation, 
see the next section, or they can be used as a component in a composite model, for 
example to model the volatility /intermittency in a Brownian semistationary process. 



4 Modelling of Volatility/Intermittency/Energy 
Dissipation 

A very general approach to specifying volatility/intermittency fields for inclusion in 
an ambit field, as in (1), is to take r = as being given by a Levy-driven Volterra 
field, either directly as 



with / positive and L a Levy basis (different from L in (1), or in exponentiated form 



When the goal is to have stationary volatility/intermittency fields, such as in mod- 
elling homogeneous turbulence, that can be achieved by choosing L to be homo- 
geneous and / of translation type. However, the potential in the specifications (12) 
and (13) is much wider, giving ample scope for modelling inhomogeneous fields, 
which are by far the most common, particularly in turbulence studies. Inhomogeneity 
can be expressed both by not having / of translation type and by taking the Levy 
basis L inhomogeneous. 

In the following we discuss two aspects of the volatility/intermittency mod- 
elling issue. Trawl processes have proved to be a useful tool for the modelling of 
volatility/intermittency and in particular for the modelling of the energy dissipation, 
as outlined in Sect. 4.1. Section 4.2 reports on a recent paper on relative volatil- 
ity/intermittency. In Sect. 4.3 we discuss the applicability of selfdecomposability to 
the construction of volatility/intermittency fields. 




( 12 ) 



z(x,t) = exp I / 




(13) 
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4.1 The Energy Dissipation 



In [31] it has been shown that exponentials of trawl processes are able to repro- 
duce the main stylized features of the (surrogate) energy dissipation observed for a 
wide range of datasets. Those stylized features include the one-dimensional marginal 
distributions and the scaling and self-scaling of the correlators. 

The correlator of order (p, is defined by 

E[8(t)Ps(t + sr] 

E[e(r)P]E[e(r +«)'?]■ 

The correlator is a natural analogue to the autocorrelation when one considers a 
purely positive process. In turbulence it is known (see the reference cited in [31]) 
that the correlator of the surrogate energy dissipation displays a scaling behaviour 
for a certain range of lags, 

oc s ^small ^ ^ ^ ^large’ 

where r (p, ^) is the scaling exponent. The exponent r (1 , 1) is the so-called intermit- 
tency exponent. Typical values are in the range 0. 1 to 0.2. The intermittency exponent 
quantifies the deviation from Kolmogorov’s 1941 theory and emphasizes the role of 
intermittency (i.e. volatility) in turbulence. In some cases, however, the scaling range 
of the correlators can be quite small and therefore it can be difficult to determine the 
value of the scaling exponents, especially when p and q are large. Therefore one also 
considers the correlator of one order as a function of a correlator of another order. In 
this case, self-scaling is observed, i.e., the one correlator is proportional to a power 
of the other correlator, 

Cp,g(s) (X Cp>^q'{sy^P’^’P'’^'\ (16) 

where z(p,q; p' , q') is the self-scaling exponent. The self-scaling exponents have 
turned out to be much easier to determine from data than the scaling exponents, 
and like the scaling exponents, the self- scaling exponents have proved to be key 
fingerprints of turbulence. They are essentially universal in that they vary very little 
from one dataset to another, covering a large range of the so-called Reynolds numbers, 
a dimensionless quantity describing the character of the flow. 

In [31] the surrogate energy dissipation s is, more specifically, modelled as 

£(0=exp(L(A(0)), (17) 

where L is a homogeneous Levy basis on M x M and A(t) = A-\- (0, t) for a bounded 
set A C M X M. The ambit set A is given as 



A = {(x, t)eRxR\0<t< Tj^rge. “/(O < -^ < f(t)}, 



(18) 
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X 



Fig. 2 The shaded region marks the ambit set A from (18) defined by (19) where the parameters 
are chosen to be Ti^^gQ = 1, ^small = 0-1 0=5 



where Tjarge ^ ^large ^ ^small ^ ^ ^ > 0, the function / is defined as 



f(t) = 



/ 1 - (r/riarge)" 
V + (^/^small)7 



0 < ^ < Tl^YgQ' 



(19) 



The shape of the ambit set is chosen so that the scaling behaviour (15) of the corre- 
lators is reproduced. The exact values of the scaling exponents are determined from 
the distribution of the Levy seed of the Levy basis. The two parameters 
7|arge determine the size of the small and large scales of turbulence: in between we 
have the inertial range. The final parameter 0 is a tuning parameter which accounts 
for the lack of perfect scaling and essentially just allows for a better fit. (Perfect 
scaling is obtained in the limit 0 oo). See Fig. 2 for an example. Furthermore, 
self-scaling exponents are predicted from the shape, not location and scale, of the 
one-point distribution of the energy dissipation alone. 

To determine a proper distribution of the Levy seed of L, it is in [31] shown 
that the one-dimensional marginal of the logarithm of the energy dissipation is well 
described by a normal inverse Gaussian distribution, i.e. log s{t) ~ NIG(ckf, fi, 8), 
where the shape parameters a and P are the same for all datasets (independent of 
the Reynolds number). Thus the shape of the distribution of the energy dissipation 
is a newly discovered universal feature of turbulence. Thus we see that L should be 
a normal inverse Gaussian Levy basis whose parameters are given by the observed 
distribution of log s{t). This completely specifies the parameters of (17). 
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4.2 Realised Relative Volatility /I ntermittency /Energy 
Dissipation 

By its very nature, volatility/intermittency is a relative concept, delineating varia- 
tion that is relative to a conceived, simpler model. But also in a model for volatil- 
ity/intermittency in itself it is relevant to have the relative character in mind, as 
will be further discussed below. We refer to this latter aspect as relative volatil- 
ity/intermittency and will consider assessment of that by realised relative volatil- 
ity/intermittency which is defined in terms of quadratic variation. The ultimate 
purpose of the concept of relative volatility/intermittency is to assess the volatil- 
ity/intermittency or energy dissipation in arbitrary subregions of a region C of space- 
time relative to the total volatility/intermittency/energy dissipation in C. In the purely 
temporal setting the realised relative volatility/intermittency is defined by 

VYs^t/VYs^T ( 20 ) 

where [7^]^ denotes the realised quadratic variation of the process Y observed with 
lag 5 over a time interval [0, r]. We refer to this quantity as RRQV (for realised 
relative quadratic volatility). 

As mentioned in Example 1, in case g is the gamma kernel (8) with v g (1/2, 1)U 
(1,3/2] then the BSS process (7) is not a semimartingale. In particular, if v g 
( 1/2, 1) — the case of most interest for the study of turbulence — the realised quadratic 
variation [Ys]t does not converge as it would if Y was a semimartingale; in fact it 
diverges to infinity whereas in the semimartingale case it will generally converge to 
the accumulated volatility/intermittency 

(rjds, (21) 

which is an object of key interest (in turbulence it represents the coarse-grained 
energy dissipation). However the situation can be remedied by adjusting [Ys]t by a 
factor depending on v; in wide generality it holds that 

ay (22) 

as8 0,withx = A.-i22(i’-i/2)(r(p) + r(p + i/2))/r(2y-l)r(3/2-l).To 
apply this requires knowledge of the value of v and in general v must be estimated with 
sufficient precision to ensure that substituting the estimate for the theoretical value of 
V in (22) will still yield convergence in probability. Under relatively mild conditions 
that is possible, as discussed in [26] and the references therein. An important aspect 
of formula (20) is that its use does not involve knowledge of v as the adjustment 
factor cancels out (Fig. 3). 

Convergence in probability and a central limit theorem for the RRQV is estab- 
lished in [ 1 0] . Figure 2 illustrates its use, for two sections of the “Brookhaven” dataset. 
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t (s) t (s) 

Fig. 3 Brookhaven turbulence data periods 18 and 25 — RRQV and 95 % confidence intervals 



one where the volatility effect was deemed by eye to be very small and one where 
it appeared strong. (The “Brookhaven” dataset consist of 20 million one-point mea- 
surements of the longitudinal component of the wind velocity in the atmospheric 
boundary layer, 35 m above ground. The measurements were performed using a 
hot-wire anemometer and sampled at 5 kHz. The time series can be assumed to be 
stationary. We refer to [27] for further details on the dataset; the dataset is called 
no. 3 therein). 



4.3 Role of Selfdecomposability 

If T is given by (12) it is automatically infinitely divisible, and selfdecomposable 
provided L has that property; whereas if r is defined by (13) it will only in exceptional 
cases be infinitely divisible. 

A non-trivial example of such an exceptional case is the following. The Gumbel 
distribution with density 



1 ( X — a { X — a\\ 

/W = ^exp(— -exp(— )), (23) 

where a G M and Z? > 0 is infinitely divisible [39]. In [31] it was demonstrated that 
the one-dimensional marginal distribution of the logarithm of the energy dissipation 
is accurately described by a normal inverse Gaussian distribution. One may also 
show (not done here) that the Gumbel distribution with h = 2 provides another fit 
that is nearly as accurate as the normal inverse Gaussian. Furthermore, if X is a 
Gumbel random variable with h = 2, then exp(X) is distributed as the square of 
an exponential random variable, hence also infinitely divisible by [39]. Therefore, if 
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the Levy basis L in (17) is chosen so that L{A) follows a Gumbel distribution with 
b = 2, then exp(L(A(0)) will be infinitely divisible. 

For a general discussion of selfdecomposable fields we refer to [13]. See also [32] 
which provides a survey of when a selfdecomposable random variable can be rep- 
resented as a stochastic integral, like in (12). Representations of that kind allow, in 
particular, the construction of field- valued processes of OU or supOU type that may 
be viewed as propagating, in time, an initial volatility/intermittency field defined on 
the spatial component of space-time for a fixed time, say t = 0. Similarly, suppose 
that a model has been formulated for the time-wise development of a stochastic field 
at a single point in space. One may then seek to define a field on space-time such that 
at every other point of space the time- wise development of the field is stochastically 
the same as at the original space point and such that the field as a whole is stationary 
and selfdecomposable. 

Example 1 {One dimensional turbulence) Let Y denote an ambit field in the tempo- 
spatial case where the spatial dimension is 1 , and assume that for a preliminary purely 
temporal model X of the same turbulent phenomenon a model has been formulated 
for the squared amplitude volatility component, say co. It may then be desirable 
to devise Y such that the volatility/intermittency field r = is stationary and 
infinitely divisible, and such that for every spatial position x the law of r (x , • ) is 
identical to that formulated for the temporal setting, i.e. co. If the temporal process 
is selfdecomposable then, subject to a further weak condition (see [13]), such a field 
can be constructed. 

To sketch how this may proceed, recall first that the classical definition of self- 
decomposability of a process X says that all the finite-dimensional marginal distri- 
butions of X should be selfdecomposable. Accordingly, due to a result by [38], for 
any finite set of time points it = (wi, . . . , w„) the selfdecomposable vector variable 
X{u) = {X{u\), . . . , X{un)) has a representation 



for some ^-dimensional Levy process L( • , w), provided only that the Levy measure 
of X{u) has finite log-moment. We now assume this to be the case and that X is 
stationary 

Next, for fixed iJ, let {L(x, iJ) | x G M}, be the ^-dimensional Levy process having 
the property that the law of L(1 , w) is equal to the law of X(u). Then the integral 



exists and the process {X(x,u) \ x G M} will be stationary — of Ornstein-Uhlenbeck 
type — while for each x the law of X{x,u) will be the same as that of X{u). 
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However, off hand the Levy processes L( • , w) corresponding to different sets u 
of time points may have no dynamic relationship to each other, while the aim is to 
obtain a stationary selfdecomposable field X(x,t) such that X(x, • ) has the same 
law as X for all v g M. But, arguing along the lines of theorem 3.4 in [9], it is possible 
to choose the representative processes L( • , w) so that they are all defined on a single 
probability space and are consistent among themselves (in analogy to Kolmogorov’s 
consistency result); and that establishes the existence of the desired field X(x,t). 
Moreover, X(- , •) is selfdecomposable, as is simple to verify. 

The same result can be shown more directly using master Levy measures and the 
associated Levy-Ito representations, cf. 113]. 

Example 2 Assume that X has the form 



where L is a Levy process. 

It has been shown in 113] that, in this case, provided g is integrable with respect 
to the Lebesgue measure, as well as to L, and if the Fourier transform of g is non- 
vanishing, then X, as a process, is selfdecomposable if and only if L is selfdecompos- 
able. When that holds we may, as above, construct a selfdecomposable field X{x,t) 
with X(x, • ) "^ ^ ( • ) for every v G M and X(' ,t) of OU type for every r g M. 

As an illustration, suppose that g is the gamma kernel (8) with v G (1/2, 1). Then 
the Fourier transform of g is 



and hence, provided that L is such that the integral (24) exists, the field X(x,u) is 
stationary and selfdecomposable, and has the OU type character described above. 



5 Time Change and Universality in Turbulence 
and Finance 

5.1 Distributional Collapse 

In 14], Barndorff-Nielsen et al. demonstrate two properties of the distributions of 
increments AgX {t) = X{t) — X{t — 1) of turbulent velocities. Firstly, the increment 
distributions are parsimonious, i.e., they are described well by a distribution with 
few parameters, even across distinct experiments. Specifically it is shown that the 




(24) 



g(0 = (i-/?/k)-^ 
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four-parameter family of normal inverse Gaussian distributions (NIG(ck:, 11,8)) 
provides excellent fits across a wide range of lags i, 

AtX - NIG(a(f), ^(f), /x(£), 8(1)). (25) 

Secondly, the increment distributions are universal, i.e., the distributions are the same 
for distinct experiments, if just the scale parameters agree, 

Ai,Xi - if and only if 81(h) = 82(12). (26) 

provided the original velocities (not increments) have been non-dimensionalized 
by standardizing to zero mean and unit variance. Motivated by this, the notion of 
stochastic equivalence class is introduced. 

The line of study initiated in [4] is continued in [ 1 6] , where the analysis is extended 
to many more data sets, and it is observed that 

Ag^Xi ~ Ai^X2 if and only if var(A£^ Ai) ~ \2ir(Ai^X2), (27) 

which is a simpler statement than (26), since it does not involve any specific distri- 
bution. In [21], Bamdorff-Nielsen et al. extend the analysis from fluid velocities in 
turbulence to currency and metal returns in finance and demonstrate that (27) holds 
when Xi denotes the log-price, so increments are log-returns. Further corrobora- 
tion of the existence of this phenomenon in finance is presented in the following 
subsection. 

A conclusion from the cited works is that within the context of turbulence or 
finance there exists a family of distributions such that for many distinct experiments 
and a wide range of lags, the corresponding increments are distributed according 
to a member of this family. Moreover, this member is uniquely determined by the 
variance of the increments. 

Up till recently these stylised features had not been given any theoretical back- 
ground. However, in [20], a class of stochastic processes is introduced that exactly 
has the rescaling property in question. 



5.2 A First Look at Financial Data from SP500 

Motivated by the developments discussed in the previous subsection, in the following 
we complement the analyses in [4, 16, 21] with 29 assets from Standard & Poor’s 
500 stock market index. The following assets were selected for study: AA, AIG, 
AXP, BA, BAG, C, CAT, CVX, DD, DIS, GE, GM, HD, IBM, INTC, JNJ, JPM, KO, 
MCD, MMM, MRK, MSFT, PG, SPY, T, UTX, VZ, WMT, XOM. For each asset, 
between 7 and 12 years of data is available. A sample time series of the log-price of 
asset C is displayed in Fig. 4, where the thin vertical line marks the day 2008-01-01. 
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asset C 




Fig. 4 Time series of the log-price of asset C on an arbitrary scale. The thin vertical line marks the 
day 2008-01-01 and divides the dataset into the two subsets “pre” (blue) and “post” (yellow) 



Asset C is found to be representative of the feature of all the other datasets. Each 
dataset is divided into two subsets: the “pre” subset consisting of data from before 
2008-01-01 and the “post” subset consisting of data from after 2008-01-01. This 
subdivision was chosen since the volatility in the “post” dataset is visibly higher 
than in the “pre” dataset, presumably due to the financial crisis. The data has been 
provided by Lunde (Aarhus University), see also [29]. 

Figure 5 shows that the distributions of log-returns across a wide range of lags 
ranging from 1 s to approximately 4.5 h are quite accurately described by normal 



C, lag = 1 



C, lag = 4 



C, lag = 16 




C, lag = 32 



C, lag = 128 



] = 512 






C, lag = 1024 



C, lag = 4096 



C, lag = 16384 






Fig. 5 Probability densities on a log-scale for the log-returns of asset C at various lags ranging from 
1 to 16384 s. The dots denote the data and the solid line denotes the fitted NIG distribution. Blue 
dindyellow denote the “pre” and “post” datasets, respectively. The log-returns have been multiplied 
with 100 in order to un-clutter the labeling of the x-axes 
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inverse Gaussian distributions, except at the smallest lags where the empirical dis- 
tributions are irregular. We suspect this is due to market microstructure noise. The 
accuracy of the fits is not surprising given that numerous publications have demon- 
strated the applicability of the generalised hyperbolic distribution, in particular the 
subfamily consisting of the normal inverse Gaussian distributions, to describe finan- 
cial datasets. See for example [1, 14, 15, 28, 40]. We note the transition from a 
highly peaked distribution towards the Gaussian as the lag increases. 

Next, we see on Fig. 6 that the distributions at the same lag of the log-returns for 
the 29 assets are quite different, that is, they do not collapse onto the same curve. This 
holds for both the “pre” and the “post” datasets. However, the transition from a highly 
peaked distribution at small lags towards a Gaussian at large lags hints that a suitable 
change of time, though highly nonlinear, may cause such a collapse. Motivated by 
the observations in [21] we therefore consider the variance of the log-returns as a 
function of the lag. Figure 7 shows how the variance depends on the lag. Except at the 
smallest lags, a clear power law is observed. The behaviour at the smallest lags is due 
to market microstructure noise [29] . Nine variances have been selected to represent 
most of the variances observed in the 29 assets. For each selected variance and each 
asset the corresponding lag is computed. We note that for the smallest lags/variances 
this is not without difficulty since for some of the assets the slope approaches zero. 

Finally, Fig. 8 displays the distributions of log-returns where the lag for each 
asset has been chosen such that the variance is the one specified in each subplot. The 
difference between Figs. 6 and 8 is pronounced. We see that for both the “pre” and 
the “post” dataset, the distributions corresponding to the same variance tend to be 
the same. Furthermore, when the “pre” and “post” datasets are displayed together, 
essentially overlaying the top part of Fig. 8 with the bottom part, a decent overlap is 
still observed. So while the distributions in Fig. 8 do not collapse perfectly onto the 
same curve for all the chosen variances, in contrast to what is the case for velocity 
increments in turbulence (see [4]), we are invariably led to the preliminary conclusion 
that also in the case of the analysed assets from S&P500, a family of distributions 
exists such that all distributions of log-returns are members of this family and such 
that the variance of the log-returns uniquely determines this member. The lack of 
collapse at the smaller variances may in part be explained by the difficulty in reading 
off the corresponding lags. 

The observed parsimony and in particular universality has implications for mod- 
elling since any proper model should possess both features. Within the context of 
turbulence, BSS-processes have been shown to be able to reproduce many key fea- 
tures of turbulence, see [35] and the following subsection for a recent example. 
The extent to which BSS-processes in general possess universality is still ongoing 
research [20] but results indicate that BSS-processes and in general LSS-process are 
good candidates for models where parsimony and universality are desired features. 
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Fig. 6 Probability densities on a log-scale for the log-returns of all 29 assets at various lags ranging 
from 1 to 16384s. The top and bottom halfs represent the “pre” and “post” datasets, respectively. 
The log-returns have been multiplied with 100 in order to un-clutter the labeling of the x-axes 
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variance vs. lag 




Fig. 7 The variance of the log-retums for the 29 assets as a function of the lag displayed in 
a double logarithmic representation. The top and bottom graphs represent the “pre” and “post” 
datasets, respectively 



5.3 Modelling Turbulent Velocity Time Series 



A specific time- wise version of (1), called Brownian semi stationary processes has 
been proposed in [18, 19] as a model for turbulent velocity time series. It was shown 
that BSS processes in combination with continuous cascade models (exponentials of 
certain trawl processes) are able to qualitatively capture some main stylized features 
of turbulent time series. 
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Fig. 8 Probability densities on a log-scale of log-returns where the lag for each asset has been 
chosen such that the variances of the assets in each subplot is the same. The chosen variances are 
also displayed in Fig. 7 as horizonal lines. For the smallest and largest variances, not all dataset are 
present since for some datasets those variances are not attained. Top “pre”, bottom “post” 
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Recently this analysis has been extended to a quantitative comparison with turbu- 
lent data [31, 35]. More specifically, based on the results for the energy dissipation 
oulined in Sect. 4.1, BSS processes have been analyzed and compared in detail to 
turbulent velocity time series in [35] by directly estimating the model parameters 
from data. Here we briefly summarize this analysis. 

Time series of the main component Vt of the turbulent velocity field are modelled 
as a BSS process of the specific form 

v(t) = v(f, g,a, 13) = [ g{t - s)G{s)B{ds)3- P f g(t - s)a(s)^ ds =: R(t) l3S(t) 

J—oo J—oo 

(28) 

where g is a non-negative L^(M+) function, a is a stationary process independent 
of 5, is a constant and B denotes standard Brownian motion. An argument based 
on quadratic variation shows that when g(O-h) ^ 0, then cr^ can be identified with 
the surrogate energy dissipation, cr^ = s, where s is the process given by (17). The 
kernel g is specified as a slightly shifted convolution of gamma kernels [30], 

g{t) = go(^ + ^o), 

go (0 = exp(-A.20l^l(vi, VI -h V2, (A.2 - A.i)01(0,oo)(0 

with a > 0, Vi > 0 and A./ > 0. Here i F\ denotes the Kummer confluent hypergeo- 
metric function. The shift is needed to ensure that g(O-h) / 0. 

The data set analysed consists of one-point time records of the longitudinal (along 
the mean flow) velocity component in a gaseous helium jet flow with a Taylor 
Reynolds number Rx = 985. The same data set is also analyzed in [31] and the 
estimated parameters there are used to specify = sm (28). The remaining para- 
meters for the kernel g and the constant ^ can then be estimated from the second 
and third order structure function, that is, the second and third order moments of 
velocity increments. In [35] it is shown that the second order structure function is 
excellently reproduced and that the details of the third order structure function are 
well captured. It is important to note that the model is completely specified from the 
energy dissipation statistics and the second and third order structure functions. 

The estimated model for the velocity is then succesfully compared with other 
derived quantities, including higher order structure functions, the distributions of 
velocity increments and their evolution as a function of lag, the so-called Kolmogorov 
variable and the energy dissipation, as prediced by the model. 



6 Conclusion and Outlook 

The present paper highlights some of the most recent developments in the theory 
and applications of Ambit Stochastics. In particular, we have discussed the existence 
of the ambit fields driven by metatime changed Levy bases, selfdecomposability of 
random fields [13], applications of BSS processes in the modelling of turbulent time 
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series [35] and new results on the distributional collapse in financial data. Some of 
the topics not mentioned here but also under development are the integration theory 
with respect to time-changed volatility modulated Levy bases [7]; integration with 
respect to volatility Gaussian processes in the White Noise Analysis setting in the 
spirit of [34] and extending [6]; modelling of multidimensional turbulence based 
on ambit fields; and in-depth study of parsimony and universality in BSS and LSS 
processes motivated by some of the discussions in the present paper. 

Open Access This chapter is distributed under the terms of the Creative Commons Attribution 
Noncommercial License, which permits any noncommercial use, distribution, and reproduction in 
any medium, provided the original author(s) and source are credited. 
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Abstract First, we revisit basic theory of functional Ito/path-dependent calculus, 
using the formulation of calculus via regularization. Relations with the corresponding 
Banach space valued calculus are explored. The second part of the paper is devoted 
to the study of the Kolmogorov type equation associated with the so called window 
Brownian motion, called path-dependent heat equation, for which well-posedness at 
the level of strict solutions is established. Then, a notion of strong approximating 
solution, called strong-viscosity solution, is introduced which is supposed to be a 
substitution tool to the viscosity solution. For that kind of solution, we also prove 
existence and uniqueness. 
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1 Introduction 



The present work collects several results obtained in the papers [9, 10], focusing 
on the study of some specific examples and particular cases, for which an ad hoc 
analysis is developed. This work is an improved version of [8], trying to explain more 
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precisely some details. For example, in [8] a slightly more restrictive definition of 
strong- viscosity solution was adopted, see Remark 12. 

Recently, a new branch of stochastic calculus has appeared, known functional 
ltd calculus, which results to be an extension of classical Ito calculus to functionals 
depending on the entire path of a stochastic process and not only on its current value, 
see Dupire [17], Cont and Fournie [5-7]. Independently, Di Girolami and Russo, and 
more recently Fabbri, Di Girolami, and Russo, have introduced a stochastic calculus 
via regularizations for processes taking values in a separable Banach space B (see 
[12-16]), including the case B = C([— T, 0]), which concerns the applications to 
the path-dependent calculus. 

In the first part of the present paper, we follow [9] and revisit functional Ito cal- 
culus by means of stochastic calculus via regularization. We recall that Cont and 
Fournie [5-7] developed functional Ito calculus and derived a functional Ito’s for- 
mula using discretization techniques of Fohmer [23] type, instead of regularization 
techniques, which in our opinion, better fit to the notion of derivative. Let us illus- 
trate another difference with respect to [5]. One of the main issues of functional Ito 
calculus is the definition of the functional (or path wise) derivatives, i.e., the hori- 
zontal derivative (calling in only the past values of the trajectory) and the vertical 
derivative (calling in only the present value of the trajectory). In [5], it is essential 
to consider functionals defined on the space of cMlag trajectories, since the def- 
inition of functional derivatives necessitates of discontinuous paths. Therefore, if 
a functional is defined only on the space of continuous trajectories (because, e.g., 
it depends on the paths of a continuous process as Brownian motion), we have to 
extend it anyway to the space of cMlag trajectories, even though, in general, there is 
no unique way to extend it. In contrast to this approach, we introduce a new space 
larger than the space of continuous trajectories C([— T, 0]), denoted by "^([— T, 0]), 
which allows us to define functional derivatives. "^([—T, 0]) is the space of bounded 
trajectories on [— T, 0], continuous on [— T, 0[ and with possibly a jump at 0. We 
endow "^([—T, 0]) with a topology such that C([—T, 0])is dense in "^([—T, 0]) with 
respect to this topology. Therefore, any functional ^ : [0, T] x C([— T, 0]) ^ M, 
continuous with respect to the topology of "^([— T, 0]), admits a unique extension 
to "^([— r, 0]), denoted u : [0, T] x "^([— T, 0]) ^ M. We present some significant 
functionals for which a continuous extension exists. Then, we develop the functional 
Ito calculus for u : [0, T] x "^([— T, 0]) ^ R. 

Notice that we use a slightly different notation compared with [5]. In partic- 
ular, in place of a map ^ : [0, T] x C([— T, 0]) ^ R, in [5] a family of maps 
F = (Ft)te[0,T], with G • ^*([0, i]) is considered. However, we can always 

move from one formulation to the other. Indeed, given F = (Ft)te[0,T], where each 
Ft : C([0, t]) R, we can define ^ : [0, T] x C([—T, 0]) ^ R as follows: 

^(t, T]) := G(^(- + T)|[o,r]), (L ^) G [0, T] x C([-T, 0]). 
Vice-versa, let ^ : [0, T] x C([—T, 0]) ^ R and define F = (Ft)te[0,T] as 



G(i?) := ^(L r]), (t, i)) ^ [0, T] x C([0, G), 



( 1 ) 
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where rj is the element of C([— T, 0]) obtained from rf firstly translating ^ on the 
interval [— C 0], then extending it in a constant way up to —T, namely r](x) := 
rj(x + ^ [“^5 0]- Observe that, in 

principle, the map ^ contains more information than F, since in (1) we do not take 
into account the values of at (C c [0, T] x C([— T, 0]) with t] not constant 
on the interval [-T, —t]. Despite this, the equivalence between the two notations 
is guaranteed; indeed, when we consider the composition of ^ with a stochastic 
process, this extra information plays no role. Our formulation has two advantages. 
Firstly, we can work with a single map instead of a family of maps. In addition, 
the time variable and the path have two distinct roles in our setting, as for the time 
variable and the space variable in the classical Ito calculus. This, in particular, allows 
us to define the horizontal derivative independently of the time derivative, so that, 
the horizontal derivative defined in [5] corresponds to the sum of our horizontal 
derivative and of the time derivative. We mention that an alternative approach to 
functional derivatives was introduced in [1]. 

We end the first part of the paper showing how our functional Ito’s formula is 
strictly related to the Ito’s formula derived in the framework of Banach space valued 
stochastic calculus via regularization, for the case of window processes. This new 
branch of stochastic calculus has been recently conceived and developed in many 
directions in [12, 14-16] ; for more details see [13] . For the particular case of window 
processes, we also refer to Theorem 6.3 and Sect. 7.2 in [12]. In the present paper, we 
prove formulae which allow to express functional derivatives in terms of differential 
operators arising in the Banach space valued stochastic calculus via regularization, 
with the aim of identifying the building blocks of our functional Ito’s formula with 
the terms appearing in the Ito’s formula for window processes. 

Dupire [17] introduced also the concept of path-dependent partial differential 
equation, to which the second part of the present paper is devoted. Di Girolami and 
Russo, in Chap. 9 of [13], considered existence of regular solutions associated with 
a path dependent heat equation (which is indeed the Kolmogorov equation related to 
window Brownian motion) with a Frechet smooth final condition. This was performed 
in the framework of Banach space valued calculus, for which we refer also to [22]. 
A flavour of the notion of regular solution in the Banach space framework, appeared 
in Chap. IV of [30] which introduced the notion of weak infinitesimal generator (in 
some weak sense) of the window Brownian motion and more general solutions of 
functional dependent stochastic differential equations. Indeed, the monograph [30] 
by Mohammed constitutes an excellent early contribution to the theory of this kind 
of equations. 

We focus on semilinear parabolic path-dependent equations associated to the 
window Brownian motion. For more general equations we refer to [9] (for strict 
solutions) and to [10] (for strong- viscosity solutions). First, we consider regular 
solution, which we call strict solutions, in the framework of functional Ito calculus. 
We prove a uniqueness result for this kind of solution, showing that, if a strict solution 
exists, then it can be expressed through the unique solution to a certain backward 
stochastic differential equation (BSDE). Then, we prove an existence result for strict 
solutions. 
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However, this notion of solution turns out to be unsuitable to deal with all signif- 
icant examples. As a matter of fact, if we consider the path-dependent PDE arising 
in the hedging problem of lookback contingent claims, we can not expect too much 
regularity of the solution (this example is studied in detail in Sect. 3.2). Therefore, 
we are led to consider a weaker notion of solution. In particular, we are interested in a 
viscosity-type solution, namely a solution which is not required to be differentiable. 

The issue of providing a suitable definition of viscosity solutions for path- 
dependent PDEs has attracted a great interest, see Peng [33] and Tang and Zhang 
[42], Ekren et al. [18-20], Ren et al. [34]. In particular, the definition of viscos- 
ity solution provided by [18-20, 34] is characterized by the fact that the classical 
minimum/maximum property, which appears in the standard definition of viscosity 
solution, is replaced with an optimal stopping problem under nonlinear expecta- 
tion [21]. Then, probability plays an essential role in this latter definition, which can 
not be considered as a purely analytic object as the classical definition of viscosity 
solution is; it is, more properly, a probabilistic version of the classical definition of 
viscosity solution. We also emphasize that a similar notion of solution, called sto- 
chastic weak solution, has been introduced in the recent paper [29] in the context of 
variational inequalities for the Snell envelope associated to a non-Markovian con- 
tinuous process X. Those authors also revisit functional Ito calculus, making use of 
stopping times. This approach seems very promising. Instead, our aim is to provide a 
definition of viscosity type solution, which has the peculiarity to be a purely analytic 
object; this will be called a strong-viscosity solution to distinguish it from the clas- 
sical notion of viscosity solution. A strong-viscosity solution to a path-dependent 
partial differential equation is defined, in a few words, as the pointwise limit of strict 
solutions to perturbed equations. We notice that the definition of strong- viscosity 
solution is similar in spirit to the vanishing viscosity method, which represents one 
of the primitive ideas leading to the conception of the modem definition of viscosity 
solution. Moreover, it has also some similarities with the definition of good solution, 
which turned out to be equivalent to the definition of -viscosity solution for certain 
fully nonlinear partial differential equations, see, e.g., [3, 11, 27, 28]. Einally, our 
definition is likewise inspired by the notion of strong solution (which justifies the first 
word in the name of our solution), as defined for example in [2, 24, 25], even though 
strong solutions are required to be more regular (this regularity is usually required 
to prove uniqueness of strong solutions, which for example in [24, 25] is based on 
a Eukushima-Dirichlet decomposition). Instead, our definition of strong- viscosity 
solution to the path-dependent semilinear Kolmogorov equation is not required to 
be continuous, as in the spirit of viscosity solutions. The term viscosity in the name 
of our solution is also justified by the fact that in the finite dimensional case we 
have an equivalence result between the notion of strong-viscosity solution and that 
of viscosity solution, see Theorem 3.7 in [8]. We prove a uniqueness theorem for 
strong-viscosity solutions using the theory of backward stochastic differential equa- 
tions and we provide an existence result. We refer to [10] for more general results 
(when the path-dependent equation is not the path-dependent heat equation) and 
also for the application of strong- viscosity solutions to standard semilinear parabolic 
PDEs. 
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The paper is organized as follows. In Sect. 2 we develop functional Ito calculus via 
regularization following [9] : after a brief introduction on finite dimensional stochastic 
calculus via regularization in Sect. 2. 1 , we introduce and study the space "^([—7,0]) 
in Sect. 2.2; then, we define the pathwise derivatives and we prove the functional 
Ito’s formula in Sect. 2.3; in Sect. 2.4, instead, we discuss the relation between func- 
tional Ito calculus via regularization and Banach space valued stochastic calculus 
for window processes. In Sect. 3, on the other hand, we study path-dependent PDEs 
following [10]. More precisely, in Sect. 3.1 we discuss strict solutions; in Sect. 3.2 
we present a significant hedging example to motivate the introduction of a weaker 
notion of solution; finally, in Sect. 3.3 we provide the definition of strong-viscosity 
solution. 



2 Functional Ito Calculus: A Regularization Approach 

2.1 Background: Finite Dimensional Calculus 
via Regularization 

The theory of stochastic calculus via regularization has been developed in several 
papers, starting from [37, 38]. We recall below only the results used in the present 
paper, and we refer to [40] for a survey on the subject. We emphasize that integrands 
are allowed to be anticipating. Moreover, the integration theory and calculus appear 
to be close to a pure pathwise approach even though there is still a probability space 
behind. 

Fix a probability space (f2, P) and T g]0, oo[. Let F = denote a 

filtration satisfying the usual conditions. Let X = (2f^)^e[0,r] (resp. Y = (Tr)r€[0,r]) 
be a real continuous (resp. F-a.s. integrable) process. Every real continuous process 
X = (2f^)^e[0,r] is naturally extended to all r G M setting Xt = Xq, ^ < 0, and 
Xt = Xt, t >T. We also define a C([—T, 0])-valued process X = (X^)^^]^, called 
the window process associated with X, defined by 



This corresponds to the so-called segment process which appears for instance in [43] . 
Definition 1 Suppose that, for every t e [0, T], the limit 



exists in probability. If the obtained random function admits a continuous modifi- 
cation, that process is denoted by JqYcI~X and called forward integral of Y with 
respect to X. 



Xt:={Xt+^,xe[-T,0]}, t eR. 



( 2 ) 




( 3 ) 



32 



A. Cosso and F. Russo 



Definition 2 A family of processes )te[0,T] is said to converge to 
in the ucp sense, if supQ<^< 7 ^ 1^/^^ — Hf \ goes to 0 in probability, as £ ^ 0+. 

Proposition 1 Suppose that the limit (3) exists in the ucp sense. Then, the forward 
integral j^Yd~X of Y with respect to X exists. 

Let us introduce the concept of covariation, which is a crucial notion in stochastic 
calculus via regularization. Let us suppose that X, Y are continuous processes. 

Definition 3 The covariation of X and Y is defined by 

[X,F], = [F,A], = lim - [\xs+s-Xs){Ys+s-Ys)ds, r G [0, T], 

£-> 0 + S Jo 

if the limit exists in probability for every t e [0,T], provided that the limiting random 
function admits a continuous version (this is the case if the limit holds in the ucp 
sense). If X = Y, X is said to be a finite quadratic variation process and we set 
[A] := [X, X]. 

The forward integral and the covariation generalize the classical Ito integral and 
covariation for semimartingales. In particular, we have the following result, for a 
proof we refer to, e.g., [40]. 

Proposition 2 The following properties hold. 

(i) Let S^, S'^ be continuous ¥ -semimartingales. Then, [5^, is the classical 
bracket [5^, = (M^, M^), where (resp. M^) is the local martingale 

part of (resp. S^). 

( a ) Let V be a continuous bounded variation process and Y be a cddldg process 
(or vice-versa); then [V] = [Y, V] = 0. Moreover Yd~V = /q YdV, is the 

Lebesgue-Stieltjes integral. 

( Hi ) If W is a Brownian motion and Y is an F -progressively measurable process such 
that Y^ds < 00, ¥-a.s., then fQYd~W exists and equals the ltd integral 
fo YdW. 

We could have defined the forward integral using limits of non-anticipating Riemann 
sums. Another reason to use the regularization procedure is due to the fact that it 
extends the Ito integral, as Proposition 2(iii) shows. If the integrand had uncountable 
jumps (as Y being the indicator function of the rational number in [0, 1]) then, 
the Ito integral f^YdW would be zero F = 0 a.e. The limit of Riemann sums 
X/ (Wti_^i — Wti) would heavily depend on the discretization grid. 

We end this crash introduction to finite dimensional stochastic calculus via regu- 
larization presenting one of its cornerstones: Ito’s formula. It is a well-known result in 
the theory of semimartingales, but it also extends to the framework of finite quadratic 
variation processes. For a proof we refer to Theorem 2.1 of [39]. 
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Theorem 1 Let F : [0, T] x M — ^ M be of class ([0, T] x R) and X = 
(Xt)te[0,T] a real continuous finite quadratic variation process. Then, the follow- 
ing ltd’s formula holds, ^-a.s.. 



+ 



-f 

2 Jo 



I 



Fit, X,) = F(0, Xo) + / dtF(s,Xs)ds + / d,F(s, x,)d-x. 



f 



d^^F(s,X,)d[X], 



0<t <T. 



(4) 



2.1.1 The Deterministic Calculus via Regularization 



A useful particular case of finite dimensional stochastic calculus via regularization 
arises when ^2 is a singleton, i.e., when the calculus becomes deterministic. In addi- 
tion, in this deterministic framework we will make use of the definite integral on 
an interval [a,b], where a < b are two real numbers. Typically, we will consider 
a = —T or a = —t and b = 0. 

We start with two conventions. By default, every bounded variation function 
/: [a,b]^ Rwillbeconsideredascadlag. Moreover, given a function/: [a,b] 

R, we will consider the following two extensions of / to the entire real line: 



fj(x) := 



0 , 

fix), 

fici), 



X > b, 

X G [a, b], 
X < a, 



fjix) := 



fib), 

fix), 

0 , 



X > b, 

X G [a, b], 
X < a, 



where J := ]a, b] and J = [a, b[. 

Definition 4 Let f, g: [< 2 , Z?] ^ R be cadlag functions, 
(i) Suppose that the following limit 



f f fji^ F s) — fjis) 

/ g(s)d-f(s) := lim / gj(s)^ ^ ds, (5) 

J[a,b] ^^ 0 + Jr ^ 



exists and it is finite. Then, the obtained quantity is denoted by gd~ f and called 

ideterministiCy definite) forward integral of g with respect to f ion [a,b]). 

(ii) Suppose that the following limit 



/ 

J\a 



gis)d^f{s) 

M 



hm / gj(s)^ ^ ds, 

£-> 0 + 6 



( 6 ) 



exists and it is finite. Then, the obtained quantity is denoted by gd^ f and called 

(deterministic, definite) backward integral of g with respect to / (on [a, b]). 
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The notation concerning this integral is justified since when the integrator / has 
bounded variation the previous integrals are Lebesgue-Stieltjes integrals on [a,b]. 

Proposition 3 Suppose / : [<2, ^ M with bounded variation and g\ [<2, ^ M 

cddldg. Then, we have 



f g(s)d f{s) = [ g{s )df{s) := g(a)f(a) + [ g{s )df{s), (7) 

J[a,b] J[a,b] J]a,b] 



[ g(s)d~'~f{s) = f g{s)df(s) := g{a)f{a) + ( g{s)df{s). 

J[aM J[aM J]aM 



( 8 ) 



Proof Identity (7). We have 



/ 8j(s)— ds = -g(a) / f(s+s)ds 

, P ^ J((s+s)Ab)-f{s)^ 

+ / g(s) ds. (9) 

Ja e 

The second integral on the right-hand side of (9) gives, by Fubini’s theorem, 

[ giv(- [ df{y)\ds= [ (- [ 

Ja J]s,{s+s)Ab] / J]a,b] J [civiy—£),y] / 

[ g{y~)df{y). 

J]a,b] 

The first integral on the right-hand side of (9) goes to g(a)f(a) as £ ^ 0+, so the 
result follows. 

Identity (8). We have 

f fj(^) fj(^ f{s) f(s — s) 

/ gj(s) ■’ ^ -ds = / g{s) ^^’ ’-ds 

Jm. ^ J a-\-e ^ 

Y ra-Ve 

+ - / g{s)f{s)ds. ( 10 ) 

£ Ja 



The second integral on the right-hand side of (10) goes to g(a)f(a) as £ ^ 0+. The 
first one equals 

f f df{y)\ds = [ (- [ g(s)ds\df(y) [ g(y)df(y), 

Ja+e 7 ] 5 — £, 5 '] / J]a,b] J]y,(y+s)Ab] / J]a,b] 

from which the claim follows. □ 
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Let us now introduce the deterministic covariation. 

Definition 5 Let f, g\ [a, b] R be continuous functions and suppose that 0 c 
[a, b]. The (deterministic) covariation of / and g (on [a, b]) is defined by 



if the limit exists and it is finite for every x G [a,b].lff = g,wQSCt[f] := [/, /] 
and it is called quadratic variation of / (on [a, b]). 

We notice that in Definition 5 the quadratic variation [/] is continuous on [a, b], 
since / is a continuous function. 

Remark 1 Notice that if / is a fixed Brownian path and g(s) = cp(s, f(s)), with 
(p G b] X R). Then g(s)d~ f(s) exists for almost all (with respect to 

the Wiener measure on C ([a, b])) Brownian paths /. This latter result can be shown 
using Theorem 2.1 in [26] (which implies that the deterministic bracket exists, for 
almost all Brownian paths /, and [/](*s') = s) and then applying Ito’s formula in 
Theorem 1 above, with R given by the Dirac delta at a Brownian path /. □ 

We conclude this subsection with an integration by parts formula for the deter- 
ministic forward and backward integrals. 

Proposition 4 Let / : [a, b] R be a cadlag function and g: [a, b] R be a 
bounded variation function. Then, the following integration by parts formulae hold: 



Proof Identity (11). The left-hand side of (11) is the limit, when e ^ 0^, of 

- / gis)f(s -r s)ds / g(s)f(s)ds + - / g(s)f(b)ds + - / g(a)f(s -T s)ds. 

^ Ja ^ Ja ^ Jb—8 ^ Ja—s 

This gives 

\ C 1 C \ C 1 /■“ 

- g{s-s)f{s)ds / g(s)f(s)ds+- g(s)f(b)ds + - g(a)f(s + s)ds 

^ J ^ J a ^ J b—e ^ Ja—s 





( 11 ) 



( 12 ) 





^ Ja—s 



g(s)-g(s-s) 1 /■“+® , 1 C 

f{s)ds / g(s)f(s)ds + - g(s)f(b)ds 

^ ^ J a ^ J b—s 



g(s) - g(s - s) 
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We see that 



^ Ja—^ 
Moreover, we have 

g(s) - g(s - s) 



1 f^’ 

s Jb- 

■f 



g{s)f{b)ds g{b )f{b). 



1 /■'* 1 £^o+ 

-J g{a)fis +s)ds - - I g{s)f{s)ds — > 0 . 



■/ 

Ja-\ 



f{s)ds = - f dsf(s)- [ dg{y) 
Ja+S ^ ^ 15 - — £, 5-1 



f 1 

- / dg(y)- / 
J]a,b] ^ Jy 



]s—s,s] 

I rbA{y+s) 



yv(a+s) 



f(s)ds 



> 0 + 



i 



]a,b[ 



dg(y)f(y). 



In conclusion, we find 
g(s)d-f(s) 



/ , 

J[a,b] 



= - [ dg(y)f(y) + (g(b) - gib-)) fib) + gib~)fib) 
J]aM 



]aM 



dgiy)fiy) + gib)fib). 



Identity (12). The left-hand side of (12) is given by the limit, as £ ^ 0+, of 



I If 

- / gis)fis)ds / 

£ Ja £ Ja 



I rb ^ rb 

- / gis)fis)ds 

£ Ja £ Ja 



g(s)f(s)ds / g(s)f(s - s)ds = - g(s)f(s)ds / g(s ^ s)f(s)ds 



■/ 



’ J., ^g(s + e) - g(s) , 1 /■* , 

f{s) ds + - g(s)f(s)ds 

e £ Jb—8 



The second integral on the right-hand side goes io gib ) fib ) as £ ^ 0+ . The first 
integral expression equals 



-/ 



^ + , 1 ^ r / , fr^Sib)-g{s) 

f]{s) ds + -f(a) g(s + s)ds+ f(s) ds 



> 0 + 



-/ , 

J]a,b] 



fis-)dgis) - fia)gia) + fia)gia) + igib) - gib-))fib~), 



taking into account identity (7). This gives us the result. 



□ 



2.2 The Spaces 0 ]) and 0 [) 

Let C([— r, 0]) denote the set of real continuous functions on [— T, 0], endowed with 
supremumnorm \\r]\\oo = sup^^[_7^ o] for any r] e C([— T, 0]). 
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Remark 2 We shall develop functional Ito calculus via regularization firstly for time- 
independent functionals ^ : C([— T, 0]) ^ M, since we aim at emphasizing that in 
our framework the time variable and the path play two distinct roles, as emphasized 
in the introduction. This, also, allows us to focus only on the definition of horizontal 
and vertical derivatives. Clearly, everything can be extended in an obvious way to 
the time-dependent case ^ : [0, T] x C([— T, 0]) ^ M, as we shall illustrate later. 

□ 

Consider a map ^ : C ([— T, 0]) ^ M. Our aim is to derive a functional Ito’s for- 
mula for . To do this, we are led to define the functional (i.e., horizontal and vertical) 

derivatives for ^ in the spirit of [5, 17] . Since the definition of functional derivatives 
necessitates of discontinuous paths, in [5] the idea is to consider functionals defined 
on the space of cadlag trajectories B([— T, 0]). However, we can not, in general, 
extend in a unique way a functional ^ defined on C([— T, 0]) to D([— T, 0]). Our 
idea, instead, is to consider a larger space than C([— T, 0]), denoted by "^([— T, 0]), 
which is the space of bounded trajectories on [— T, 0], continuous on [— T, 0[ and 
with possibly a jump at 0. We endow 0]) with a (inductive) topology such 

that C([— r, 0]) is dense in "^([— T, 0]) with respect to this topology. Therefore, if 
^ is continuous with respect to the topology oC^([— T, 0]), then it admits a unique 
continuous extension u : "^([— T, 0]) ^ M. 

Definition 6 We denote by "^([— T, 0]) the set of bounded functions ^ : [— T, 0] 
^ M such that rj is continuous on [-T, 0[, equipped with the topology we now 
describe. 

Convergence We endow ^([—T,0]) with a topology inducing the following con- 
vergence: (r]n)n converges to r] in 0]) as n tends to infinity if the following 

holds. 

(i) II I loo < C, for any n eN, for some positive constant C independent of n; 

(ii) sup^g^ l^nM “ ^(-^)l ^ 0 as ^ tends to infinity, for any compact set K C 
[-T, 0[; 

(iii) T]n (0) ^ ^ (0) as n tends to infinity. 

Topology For each compact K C [— T, 0[ define the seminorm px on^{[—T,0])hy 
PK(r]) = sup |^(v)| -h |yy(0)|, Vyy G ^([-r, 0]). 

xeK 

Let M > 0 and 0]) be the set of functions in 0]) which are 

bounded by M. Still denote px the restriction of px to ([— T, 0]) and consider 

the topology on 0]) induced by the collection of seminorms (pk)k- Then, 

we endow ^([—T,0]) with the smallest topology (inductive topology) turning all 
the inclusions (m ’ 0]) ^ into continuous maps. 
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Remark 3 (i) Notice that C([— T, 0]) is dense in "^([— T, 0]), when endowed with 
the topology of "^([— T, 0]). As a matter of fact, let r] g ^([—T, 0]) and define, for 
any n e N\{0}, 



^ X ~T < X < 

n(rj(0) — rj(—l/n))x + ^(0), —l/n < x <0. 

Then, we see that cpn e C([—T, 0]) and cpn ^ rj in "^([— T, 0]). 

Now, for any a G M define 

Ca([-T, 0]) := [t] g C([-r, 0]) : ^(0) = a}, 
^a([-T, 0]) := [r] G m-T, 0]) : ^(0) = aj. 



Then, C«([— T, 0]) is dense in %([— T, 0]) with respect to the topology of 

m-T, 0]). 

(ii) We provide two examples of functionals ^ : C([—T, 0]) ^ R, continuous with 
respect to the topology of "^([— T, 0]), and necessarily with respect to the topology 
of C([—r, 0]) (the proof is straightforward and not reported): 

(a) ^ (?]) = g(r](ti), . . . , rj(tn)), for all r] g C([-T, 0]), with -T < < - < 

tn <0 and g : R'^ ^ R continuous. 

(b) ^ ( t ]) = Q^(p(x)d~r](x), for nil T] g C([— T, 0]), with (^ : [0, T] ^ Ra 

cadlag bounded variation function. Concerning this example, keep in mind that, 
using the integration by parts formula, ^ (r]) admits the representation (11). 

(iii) Consider the functional ^ (r]) = sup^^[^_ 7 ^ qj t](x), for all t] G C([—T, 0]). It 
is obviously continuous, but it is not continuous with respect to the topology of 
^([—T, 0]). As a matter of fact, for any n eN consider % g C([—T, 0]) given by 



r]nM 



0 , 

on+l 
on+l 

— 



+ 2 , 



-T <x < 

_I1 < r < Z_ 

2n ^ X ^ 2«+i ’ 

< X < 0. 



Then, ^ (r]n) = sup^^[^_ 7 ^ qj r]n(x) = 1, for any n. However, r]n converges to the zero 
function in "^([— T, 0]), as ^ tends to infinity. This example will play an important 
role in Sect. 3 to justify a weaker notion of solution to the path-dependent semilinear 
Kolmogorov equation. □ 

To define the functional derivatives, we shall need to separate the “past” from the 
“present” of ^ g ^([—T, 0]). Indeed, roughly speaking, the horizontal derivative 
calls in the past values of r], namely {r](x): x G [— T,0[}, while the vertical derivative 
calls in the present value of r], namely r](0). To this end, it is useful to introduce the 
space ^([—T, 0[). 
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Definition 7 We denote by "^([— T, 0[) the set of bounded continuous functions 
y : [— r, 0 [ ^ M, equipped with the topology we now describe. 

Convergence We endow "^([— T, 0[) with a topology inducing the following con- 
vergence: (yn)n converges to y in ^([—T, 0 [) as n tends to infinity if: 

(i) sup^g[_ 7 ^ Q|^ \ YnM\ < C, for any n eN, for some positive constant C indepen- 
dent of n ; 

(ii) sup^g^ IVnM — y Ml 0 as n tends to infinity, for any compact set K C 
[-T, 0[. 

Topology For each compact K C [— T, 0[ define the seminorm ^ 7 ^ on "^([— T, 0[) by 
qK(y) = supIkWI, Wy e^([-T,0[). 

xeK 

Let M > 0 and 0[) be the set of functions in ^([—T, 0[) which are 

bounded by M. Still denote qx the restriction of qx to 0[) and consider 

the topology on 0[) induced by the collection of seminorms (q^K- Then, 

we endow ^([—T,0[) with the smallest topology (inductive topology) turning all 
the inclusions /m • 0[) ^ "^([— r, 0 [) into continuous maps. 

Remark 4 (i) Notice thaC^([— T, 0]) is isomorphic to "^([— T, 0[) x M. As a matter 
of fact, it is enough to consider the map 

J : ^([-r, 0]) ^ ^([-r, 0[) X M 

q \-> (q\[-T,0b ^(0))- 

Observe that J~^: ^([-r,0[) x M ^ ^([-7,0]) is given by J~\y,a) = 

yl[-r,o[ + <^1{0}- 

(ii) "^([— r, 0 ]) is a space which contains C([— T, 0 ]) as a dense subset and it has the 
property of separating “past” from “present”. Another space having the same property 
is L^([—T, 0]; dll) where /x is the sum of the Dirac measure at zero and Lebesgue 
measure. Similarly as for item (i), that space is isomorphic to L^([—T, 0]) x M, 
which is a very popular space appearing in the analysis of functional dependent (as 
delay) equations, starting from [4]. □ 

For every u : ^([—T, 0]) ^ M, we can now exploit the space 0[) to 

define a map u : 0 [) x M ^ M where “past” and “present” are separated. 

Definition 8 Let u : 0]) ^ M and define : "^([— T, 0[) x M ^ M as 

u(y,a) := u(yl[-T,o[ + al{ 0 }), V (y, < 2 ) G ^([-T, 0[) x R. (13) 
In particular, we have u(q) = u(q\[-T,o[, ^(0)), for all q G ^{[—T, 0]). 
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We conclude this subsection with a characterization of the dual spaces of 
"^([— r, 0]) and "^([— r, 0[), which has an independent interest. Firstly, we need 
to introduce the set ^([—T, 0]) of finite signed Borel measures on [— T, 0]. We also 
denote 0]) c ^([— T, 0]) the set of measures ii such that /x({0}) = 0. 

Propositions Let A e ^([—T,0]y, the dual space r>/'^([— T, 0]). Then, there 
exists a unique /x G 0]) such that 

At] = I T](x)fi(dx), Vt] 

Proof Let A g ^([-L, 0])* and define 

Acp \= A(p, 'i(p G c([-r,0]). 

Notice that A\ C{[—T, 0]) ^ M is a continuous functional on the Banach space 
C([— r, 0]) endowed with the supremum norm || • ||oo. Therefore A g C([— T, 0])* 
and it follows from Riesz representation theorem (see, e.g.. Theorem 6.19 in [36]) 
that there exists a unique /x G such that 

Acp = [ cp{x)ii{dx), G C([-r,0]). 

Obviously A is also continuous with respect to the topology of "^([— T, 0]). Since 
C([— r, 0]) is dense in "^([— T, 0]) with respect to the topology of "^([— T, 0]), we 
deduce that there exists a unique continuous extension of A to "^([—T, 0]), which is 
clearly given by 



At] = [ T](x)pi(dx), Wt] 



□ 



Proposition 6 Let A g 0[)*, the dual space of 0[). Then, there 

exists a unique pi G ^q{{—T, 0]) such that 

Ay = [ y(x)pi(dx), VyG^([-r,0[). 

Proof Let A g "^([— T, 0[)* and define 



Aq := xl(^|[_r,o[), V^G^([-r,0]). 



(14) 
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Notice that yi : 0]) ^ M is a continuous functional on "^ ([— T, 0]). It follows 

from Proposition 5 that there exists a unique /x G ^([— T, 0]) such that 

At] = [ T]{x)fi{dx) = [ f](x)fx{dx)-\-r]{0)fi{{0}), W r] e ^([-T,0]). 

J[-T,0] n-T,0[ 

(15) 

Let? 7 i, T ]2 c ^([—T, 0]) be such that l[_r, 0 [ = ^ 2 l[-r, 0 [- Then, we seefrom (14) 
that yi? 7 i = At] 2 , which in turn implies from (15) that /x({0}) = 0. In conclusion, 
fi G ^o([—T, 0]) and A is given by 

Ay = [ y(x)fji(dx), V y G ^([-7, 0[). 

J[-T,0[ 



2.3 Functional Derivatives and Functional Ito^s Formula 



In the present section we shall prove one of the main result of this section, namely 
the functional Ito’s formula for : C([— T, 0]) ^ M and, more generally, for 
^ : [0, T] X C([— r, 0]) ^ M. We begin introducing the functional derivatives, 
firstly for a functional u : "^([— T, 0]) ^ M, and then for ^ : C([— T, 0]) ^ M. 

Definition 9 Consider u : "^([— T, 0]) ^ M and t] g ^([—T, 0]). 

(i) We say that u admits the horizontal derivative at r] if the following limit exists 
and it is finite: 

i/(y7(01[-r,o[ + y?(Q)l{0|) - g)l[-r,o[ + y7(Q)l{0|) 

£-> 0 + s 

(16) 

(i)’ Let u be as in (13), then we say that u admits the horizontal derivative at 
(]/, a) G "^([— r, 0[) X M if the following limit exists and it is finite: 



D^u{y,a) := lim 

£-> 0 + 



u(y(-), a) — u(y(' — s), a) 



(17) 



Notice that if D^u(r]) exists then u(t]\[-t,o[, ^(0)) exists and they are equal; 
viceversa, if D^u(y, a) exists then D^w(}/I[_r, 0 [ + <^1{0}) exists and they are equal, 
(ii) We say that u admits the first-order vertical derivative at r] if the first-order 
partial derivative 9afi(^|[-r,0[, ^(0)) at ^(0)) ofii with respect to its second 

argument exists and we set 



D^u{r]) := 9aM(??|[-r,0[, »7(0)). 
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(iii) We say that u admits the second-order vertical derivative at r] if the second- 
order partial derivative aX ^(0)) of w with respect to its second argument, 

denoted by d^^u(r]\[-T,o[, ^(0)), exists and we set 

Definition 10 We say that u : ^([—T, 0]) ^ M is of class "^^’^(past x present) if 

(i) u is continuous; 

(ii) D^u exists everywhere on "^([— T, 0]) and for every y g "^([— T, 0[) the map 

(s, a) I — ^ D^u(y(’ — s), a), (s, a) G [0, oo[xM 

is continuous on [0, oo[x]R; 

(iii) D^u and exist everywhere on "^([— T, 0]) and are continuous. 

Remark 5 Notice that in Definition 10 we still obtain the same class of functions 
"^^’^(past X present) if we substitute point (ii) with 

(ii’) D^u exists everywhere on "^([— T, 0]) and for every y g ^([—T,0[) there 
exists 8(y) > 0 such that the map 

(£,a)\ — > u(y(- — s), a), (e, < 2 ) G [0, oo[xM (18) 

is continuous on [0, 5(}/)[xR. 

In particular, if (ii’) holds then we can always take 8(y) = oo for any y g 
^([—T,0[), which implies (ii). To prove this last statement, let us proceed by con- 
tradiction assuming that 

5*(}/) = sup {5(}/) > 0: the map (17) is continuous on [0, 5 (]/)[xIR} < oo. 

Notice that 5*(]/) is in fact a max, therefore the map (18) is continuous on 
[0, 5*(y)[xM. Now, define y(-) := ]/(• — 5*(y)). Then, by condition (ii’) there 
exists 8(y) > 0 such that the map 

(s, a) I — ^ D^u(y{- — s), a) = D^u(y{- — s — 5*(}/)), a) 

is continuous on [0, 5(}/)[x]R. This shows that the map (18) is continuous on 
[0, 5*(y) -h 5(]7)[xM, a contradiction with the definition of 5*(y). □ 

We can now provide the definition of functional derivatives for a map ^ : C ( [ — T , 0] ) 

^ M. 



Definition 11 Let ^ : C([—T, 0]) ^ M and r] g C([—T, 0]). Suppose that there 
exists a unique extension u : ^([—T, 0]) ^ M of (e.g., if ^ is continuous with 
respect to the topology of "^([— T, 0])). Then we define the following concepts. 
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(i) The horizontal derivative of at ^ as: 

:= D^u(r]). 

(ii) The first-order vertical derivative of at as: 

:= D^u(t]). 



(hi) The second-order vertical derivative of at yy as: 






Definition 12 We say that : C([— T, 0]) ^ Mis (past x present) if admits 

a (necessarily unique) extension u : ^([—T, 0]) ^ M of class "^^’^(past x present). 

Theorem 2 Let ^ : C([— T, 0]) ^ he of class C^’^ipast x present) and 
X = (Xt)te[0,T] he a real continuous finite quadratic variation process. Then, the 
following functional ltd formula holds, ^-a.s.. 



^(X,) = ‘^(Xo)+ f D^‘^(Xs)ds+ [‘ D^‘^(Xs)d-Xs + - f 
Jo Jo 2 Jo 

for alio < t < T, where the window process X was defined in (2). 

Proof Fix t e [0, F] and consider the quantity 



^(Xfid[Xh, 

(19) 



Io(e, t) 



f 



^(Xs+s)-^(^s) ^^ 

S 



1 1 r 

- ^ i^s)ds / ^ i^s)ds, £ > 0. 

^ Jt £ Jo 



Since the process (Xs))s>0 is continuous, Io(s, t) converges ucp to ^ (X^) — 
^ (Xq), namely supQ<^< 7 ^ |fo(^^ 0 ~ Q^t) ~ ^ (^o))l converges to zero in prob- 

ability when £ ^ 0+ . On the other hand, we can write /q (e, t) in terms of the function 
u, defined in (13), as follows 



/o(^,0 = [ 

Jo 



^ ii(Xs^s\[-T,0[, Xs+s) - iiQ^s\[-T,0[, Xs) 



Now we split Io(s,t) into the sum of two terms 



ds. 



_ r ^(^s+s\[-T,0b ^s+s) ii(Xs\[-T,0[, Xs^s) 

^ ’ 7o ^ 



( 20 ) 



h(s,t) 



i 



^ ^Q^s\[-T,0[, Xs^s) - ^Q^s\[-T,0[, Xs) 



ds. 



£ 



( 21 ) 
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We begin proving that 

/i(e,0 [ D^^(X,)ds. (22) 

£^0+ Jo 

Firstly, fix y e"^{[—T,0[) and define 

(p(s,a) := u(y(- — s),a), (£, < 2 ) G [0, oo[xM. 

Then, denoting by 9+ 0 the right partial derivative of 0 with respect to s and using 
formula (17), we find 



0(e + r,^i) -0(e,a) 

dj^(p{£,a)= hm 



0+ r 

u(y(' — s), a) — u(y(' — s — r), a) 

= — hm 

r^o+ r 

= —D^u(y(‘ — £),a), V (£, a) G [0, oo[xM. 

Since u G "^^’^(past x present), we see from Definition lO(ii), that 9+ 0 is continuous 
on [0, oo[xIR. It follows from a standard differential calculus result (see for example 
Corollary 1.2, Chap. 2, in [32]) that 0 is continuously differentiable on [0, oo[x]R 
with respect to its first argument. Then, for every (s, a) G [0, 00 [xR, from the 
fundamental theorem of calculus, we have 



0(e, (2) — 0(0 

which in terms of u reads 

u(y(J,a) - fi(}/(- - s) 



,a) = r 

Jo 



ds(j)(r, a)dr, 



,a) = r 

Jo 



D^u(y(- — r), a)dr. 



(23) 



Now, we rewrite, by means of a shift in time, the term I\ (s, t) in (20) as follows: 

_ r ^(^s\[-T,0[^ Xs) - ^(X^-g|[-r,0[, 

^ ’ 7o £ 



-I 



fi(X^|[-7-^0[, Xs) - u(Ks-£\[-T,0[^ 

£ 

u(Xs\[-T,0[’ ^s) - iiQ^s-s\[-T,0[^ ^s) 



ds. 



(24) 



Plugging (23) into (24), setting y = X^, <2 = we obtain 
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/i(e,0 = 



-i: 



D^u{Xs-r\[-T, Ob Xs)dr^ds 
D^u(Xs-r\[-T, Ob Xs)dr^ds 



e \Jo 

lo D^u(Xs-r\[-T, 0 [^ ^s)dr^ds 



(25) 



Observe that 

D^u(Ks-r\[-T, 0 [’ Xs)dr^ds J D^u(Ks)ds. 

Similarly, we see that the other two terms in (25) converge ucp to zero. As a conse- 
quence, we get (22). 

Regarding l 2 (s, t) in (21), it can be written, by means of the following standard 
Taylor’s expansion for a function / g C^(M): 




fib) = fia) + f’ia){b - a) + -f"ia){b - a)^ 



+ 



[ (1 - a){f'(a + a(b - a)) - f''ia))(b - a)^da, 

Jo 



as the sum of the following three terms: 

f2l(^, 0 = 

t) = 



,t)= f 9„m(X,|[_t,o[, ^' ds 

Jo ^ 



1 /■' 9 - - Xs)^ 

- dlu(K,^i-T, Ob xy-^ ^-^ds 

^ Jo 6 



h'ii£,t)= (1 - «)(9aflM(X5|[-r,0[. 



X,+a(X,+e-X,)) 



^aa^0^s\l-TM’ Xs)) 



(X,+, - Z ,)2 



dot \ds. 



By similar arguments as in Proposition 1.2 of [39], we have 



/22(e, t) \ f 9iM(X,|[_r,o[, X,)d[X], ^ \ f u{Xs)d[X)s. 
e^0+ 2 Jo 2 Jo 

Regarding / 23 (£, t), for every co e 22, define • [0, T] x [0, 1] x [0, 1] ^ M as 
ifco(s,ct,£) := (I -a)dj^u{Xs\[-T, 0 [(^)^^s(co) +Oi(Xs+e(co) - Xs(co))), 
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for all (s, a, s) e [0, T] x [0, 1] x [0, 1]. Notice that \l/oj is uniformly continuous. 
Denote its continuity modulus, then 

sup |/23(e, 01 < [ 
re[0,r] Jo e 

Since X has finite quadratic variation, we deduce that hsis, t) ^ 0 ucp as £ ^ 0+. 
Finally, because of 7o(e, t), Ii(s, t), hii^, t), and / 23 (£, t) converge ucp, it follows 
that the forward integral exists: 

/2i(e,0 r 9«m(X,|[_t,0[, = f D^u{X,)d-X„ 

£->0+ Jo Jo 

from which the claim follows. 



Remark 6 Notice that, under the hypotheses of Theorem 2, the forward integral 
foD^ ^ (X^) J Xs exists as a ucp limit, which is generally not required. □ 

Remark 7 The definition of horizontal derivative. Notice that our definition of hor- 
izontal derivative differs from that introduced in [17], since it is based on a limit on 
the left, while the definition proposed in [17] would conduct to the formula 






lim 

s^0+ 



-h g)l[-r,Q[, y?(Q)) - fi(y?(01[-r,Q[, y?(Q)) 

£ 



(26) 



To give an insight into the difference between (16) and (26), let us consider 
a real continuous finite quadratic variation process X with associated window 
process X. Then, in the definition (26) of D^’+w(X^) we consider the incre- 
ment u(Kt\[-T,0[(' + ^)’ — i^Q^t\[-T,0[’ 2f^), comparing the present value of 

u(Kt) = i^(Xr|[-r,0[, ^t) with an hypothetical future value u(Kt\[-T,0[{’ + X^), 

obtained assuming a constant time evolution for X. On the other hand, in our defi- 
nition (16) we consider the increment u(Xt\[-T,o[, X^) — fi(X^_g|[_7 o[. X^), where 
only the present and past values of X are taken into account, and where we also 
extend in a constant way the trajectory of X before time 0. In particular, unlike (26), 
since we do not call in the future in our formula (16), we do not have to specify 
a future time evolution for X, but only a past evolution before time 0. This differ- 
ence between (16) and (26) is crucial for the proof of the functional Ito’s formula. 
In particular, the adoption of (26) as definition for the horizontal derivative would 
require an additional regularity condition on u in order to prove an Ito formula for 
the process t \-> w(X^). Indeed, as it can be seen from the proof of Theorem 2, to 
prove Ito’s formula we are led to consider the term 

^ u(Xs+s\[-T,0[^ X^+g) - u(Xs\[-T,0[, Xs+s) 



h(s,t) = [ 
Jo 



£ 
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When adopting definition (26) it is convenient to write I\ (e, t) as the sum of the two 
integrals 



7ii(e, 0 



0 



/' 

/' 



^(^^+g|[-r,0[, u(Ks\[-t,0[(' + g), Xs^s) 

£ 

u(Xs\[-T, 0 [(- + g), Xs^s) ^(^^|[-r,0[, 



It can be shown quite easily that, under suitable regularity conditions on u (more 
precisely, if u is continuous, exists everywhere on "^([— T, 0]), and for every 

y G ^([—T, 0[) the map (s, a) \ — ^ Z)^’+fi(}/(- -\-s),a) is continuous on [0, oo) x 
M[, we have 

Il2(s,t) — ^ 

^^0+ Jo 



To conclude the proof of Ito’s formula along the same lines as in Theorem 2, we 
should prove 

7ii(e,0 ^ 0. (27) 

s^0+ 



In order to guarantee (27), we need to impose some additional regularity condition 
on u, and hence on i/. As an example, (27) is satisfied if we assume the following 
condition on u : there exists a constant C > 0 such that, for every e > 0, 



\u(yi,a) - u{y 2 ,a)\ < Cs sup \yi(x) - Y2(x)\, 

X€[— £,0[ 



for all yi, 72 c "^([— T, 0[) and a eR, with yi(x) = y 2 (x) for any x e [-T, —s]. 
This last condition is verified if, for example, u is uniformly Lipschitz continuous 
with respect to the ([— T, 0])-norm on "^([— T, 0[), namely: there exists a constant 

C > 0 such that 

\u(yi,a) - u(y 2 ,a)\ <C \yi(x) - y 2 (x)\dx, 

7[-r,0[ 

for all 7i, 72 c ^([—T, 0[) and a eR. □ 

We conclude this subsection providing the functional Ito’s formula for a map 
^ : [0, T] X C ([— r, 0]) ^ R depending also on the time variable. Firstly, we notice 
that for a map ^ : [0, T] x C([-T, 0]) ^ M (resp. u : [0, T] x ^([-T, 0]) ^ R) 
the functional derivatives and (resp. D^u, D^u, and D^^u) 

are defined in an obvious way as in Definition 1 1 (resp. Definition 9). Moreover, given 
u : [0, T] X ^([—T, 0]) ^ M we can define, as in Definition 8, a map u : [0, T] x 
^([—T, 0[) X M ^ R. Then, we can give the following definitions. 
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Definition 13 Let 1 be [0, T[ or [0, T]. We say that u: I x 0]) ^ M is of 

class X past) x present) if the properties below hold. 

(i) u is continuous; 

(ii) dtu exists everywhere on / x and is continuous; 

(hi) D^u exists everywhere on I x^([—T,0]) and for every y g "^([— L, 0[) the 
map 



(t, s, a) I — ^ D^u(t, y(- — s), a), (t, s,a) e I x [0, oo[xIR 

is continuous on / x [0, oo[xM; 

(iv) D^u and exist everywhere on I x ^([—T,0]) and are continuous. 

Definition 14 Let I be [0, T[ or [0, T]. We say that ^ \ 1 x C([— L, 0]) ^ M is 
C^’^((/ X past) X present) if ^ admits a (necessarily unique) extension u: I x 
^([—T, 0]) ^ M of class "^^’^((/ x past) x present). 

We can now state the functional Ito’s formula, whose proof is not reported, since 
it can be done along the same lines as Theorem 2. 

Theorem 3 Let ^ : [0, T] x C([—T, 0]) Rbe of class C^’^(([0, T] x past) x 
present) and X = (Xt)te[0,T] a real continuous finite quadratic variation process. 

Then, the following functional ltd formula holds, F-a.s., 

^(t,Xt)= ^(0,Xo)+ f (dt^(s,Xs) + D^^(s,Xs))ds-\- f D^^{s,Xs)d~Xs 
Jo Jo 

+ l h D'^'^^(s,Xs)d[X]s, (28) 

2 Jo 

for alio < t < T. 

Remark 8 Notice that, as a particular case, choosing ^{t,r]) = F(t, q(0)), for any 
(t, q) G [0, T]xC([—T, 0]),with/^ G C^’^([0, T] xM), we retrieve the classical Ito 
formula for finite quadratic variation processes, i.e. (4). More precisely, in this case 
^ admits as unique continuous extension the map u: [0, T] x"^([— T, 0]) ^ M 
given by u(t,q) = F(t, q(0)), for all (t, q) G [0, T] x ^([—T, 0]). Moreover, we 
see that = 0, while ^ = 9;^^ and ^ , where 9;^^ (resp. 

9^^ F) denotes the first-order (resp. second-order) partial derivative of F with respect 
to its second argument. □ 



2.4 Comparison with Banach Space Valued Calculus 
via Regularization 



In the present subsection our aim is to make a link between functional Ito calculus, as 
derived in this paper, and Banach space valued stochastic calculus via regularization 
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for window processes, which has been conceived in [13], see also [12, 14-16] for 
more recent developments. More precisely, our purpose is to identify the building 
blocks of our functional Ito’s formula (19) with the terms appearing in the Ito formula 
derived in Theorem 6.3 and Sect. 7.2 in [12]. While it is expected that the vertical 
derivative ^ can be identified with the term ^ of the Frechet derivative, it is 
more difficult to guess to which terms the horizontal derivative corresponds. 

To clarify this latter point, in this subsection we derive two formulae which express 
m terms of Frechet derivatives of ^ . 

Let us introduce some useful notations. We denote by BV([—T,0]) the set of 
cadlag bounded variation functions on [— T, 0], which is a Banach space when 
equipped with the norm 



ll^ll5V([-r,0]) •= l^(0)| + ||^||var([-r,0]), T] e BV([-T,0]), 

where ||^||var([-r,oi) = 0]) and \dr] \ is the total variation measure associ- 
ated to the measure dr] e 0]) generated hy r]\ dr] (] — T, —t]) = r](—t) — 

r](—T), t e [-T, 0]. We recall from Sect. 2.1 that we extend r] e BV{[—T, 0]) to 
all V G M setting ? 7 (v) = 0, v < — T, and r]{x) = r](0), v > 0. Let us now introduce 
some useful facts about tensor products of Banach spaces. 

Definition 15 Let (E, || • ||^) and (F, || • \\f) be two Banach spaces. 

(i) We shall denote hy E (S> F the algebraic tensor product of E and F, defined as 

the set of elements of the form v = 2^=1 ® some positive integer n, where 

e e E and f e F. The map (g): ExF^E<S>F is bilinear. 

(ii) We endow E <S> F with the projective norm tt : 



7t(v) := inf 



n 

'y. ikiiiEii/iiiF 

i=l 



n 

V = ye,- (g) fi 
i=l 



y V e E <S> F. 



(iii) We denote hy E(^jj^F the Banach space obtained as the completion of (g) F for 
the norm tt . We shall refer to E<^j^F as the tensor product of the Banach spaces 
E and F. 

(iv) If E and E are Hilbert spaces, we denote Ef^^F the Hilbert tensor product, 
which is still a Hilbert space obtained as the completion of E(S>F for the scalar product 
{e^ (g) f, (g) f") := {e\ e")E{f, Df, for any g E and f\ f" G F. 

(v) The symbols E^^ and ^(g) denote, respectively, the Banach space E^j^E and 
the element e e of the algebraic tensor product E (S) E. 

Remark 9 (i) The projective norm tt belongs to the class of the so-called reasonable 
crossnorms a on E F, verifying a(e f) = Ik II ^11/ II f- 

(ii) We notice, proceeding for example as in [16] (see, in particular, formula (2.1) in 
[16] ; for more information on this subject we refer to [41]), that the dual (E<^jj^ F)* of 
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E(^y^F is isomorphic to the space of continuous bilinear forms (E, E), equipped 
with the norm || • \\e,f defined as 

\\ 0 \\ e,f := sup \0(e,f)\, W0em{E,F). 

eeE,feF 

\\e\\E,\\f\\F<l 

Moreover, there exists a canonical isomorphism between ^i(E, E) and L(E, F*), 
the space of bounded linear operators from E into E*. Hence, we have the following 
chain of canonical identifications: {E^j^EY = ^i(E, E) = L(E; E*). □ 

Definition 16 Let £" be a Banach space. We say that ^ ^ M is of class C^(E) 

if 

(i) D^, the first Frechet derivative of belongs to C(E; E*) and 

(ii) , the second Frechet derivative of ^ , belongs to C{E\ L(E; £’*)). 

Remark 10 Take £' = C([— T, 0])in Definition 16. 

(i) Eirst Erechet derivative . We have 

D^:C([-r,o]) ^ (C([-r,o]))* = ^([-r,o]). 

For every rj e C([— T, 0]), we shall denote Ddx ^ the unique measure in 
^([— r, 0]) such that 

= [ (p(x)Ddx^(r]), V^GC([-r,0]). 

J[-T,0] 

Notice that ^([—T,0]) can be represented as the direct sum: ^([—T,0]) = 
O])0^o, where we recall that ^o([— 7", 0]) is the subset of ^([—T, 0])of 
measures /x such that /x({0}) = 0, instead (which is a shorthand for ^o([—T , 0])) 
denotes the one-dimensional space of measures which are multiples of the Dirac mea- 
sure 8q. For every r] e C([—T, 0]) we denote by (rj), unique 

pair in 0]) 0 such that 

Ddx^i^) = 

(ii) Second Erechet derivative . We have 

c([-r,o]) ^ L(C([-r,o]); (C([-r,o]))*) = ^/(C([-r,o]),c([-r,o])) 

= {C{[-T, 0f)®y,C{[-T, 0]))*, 

where we used the identifications of Remark 9(ii). Let rj e C([—T, 0]); a typical 
situation arises when there exists Ddx dy^ (h) ^ ^([—T, 0]^) such that (r]) e 
L(C([—T, 0]); (C([— r, 0]))*) admits the representation 
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D'^-W(r])((p,il/) = [ (p{x)\lriy)Ddxdy^{ij), 'i (p, il/ G C([-T,0]). 
7[-r,0]2 

Moreover, D^xdy ^ (^) is uniquely determined. □ 

The definition below was given in [13]. 

Definition 17 Let £" be a Banach space. A Banach subspace (x , INI x ) continuously 
^ 2 

injected into (£’(g)^)*, i.e., || • |lx > II • will be called a Chi-subspace (of 

Remark 11 Take E = C([— T, 0]) in Definition 17. As indicated in [13], a typi- 
cal example of Chi-subspace of C([— T, 0])(g)^ is ^([— T, 0]^) equipped with the 
usual total variation norm, denoted by || • ||var- Another important Chi-subspace of 

C([— r, 0])(g)^ is the following, which is also a Chi-subspace of ^([— T, 0]^): 

XO •= {m ^ 0]^) : ii{dx, dy) = gi(x, y)dxdy -h Xi 8 o{dx) <S> 8 o{dy) 

-h giMdx (8) X 28 o(dy) -h X 28 o(dx) (g) g^{y)dy -h gA,{x) 8 y{dx) (g) dy, 
gl G 0 ] 2 ), g 2 , g 3 € L^([-T, 0 ]), g 4 € 0 ]), Aj, Aj, A 3 € R}. 



Using the notations of Example 3.4 and Remark 3.5 in [16], to which we refer for 
more details on this subject, we notice that xo is indeed given by the direct sum xo = 
L^{[-T, 0]2) © (L2([-r, 0]m%) © {%®hL^a-T,0])) © Of) © 

Diag([—T, 0]^). In the sequel, we shall refer to the term g4(x)8y(dx) (g) dy as the 
diagonal component and to g4(x) as the diagonal element of /x. □ 

We can now state our first representation result for 

Proposition 7 Let ^ : C([— T, 0]) ^ be continuously Frechet differentiable. 

Suppose the following. 

(i) For any rj g C([—T, 0]) there exists (rj) e BV([—T, 0]) such that 

= Dff^(r])dx. 

( a ) There exist continuous extensions ( necessarily unique ) 

u : ^([-T, 0]) ^ M, D^^u : ^([-T, 0]) ^ BV([-T, 0]) 

of^ and , respectively. 

Then, for any q e C([—T, 0]), 

= I Df^(r])d+r^(x), ( 29 ) 

7[-r,0] 
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where we recall that the previous deterministic integral has been defined in Sect. 
2.1.1. In particular, the horizontal derivative (rj) and the backward integral 

in (29) exist. 

Proof Let r] g C([— T, 0]), then starting from the left-hand side of (29), using 
the definition of (r]), we are led to consider the following increment for the 

function u : 



u(r]) - u(r]f - g)l[-r,0[ + y?(Q)l{0|) 
£ 



(30) 



We shall expand (30) using a Taylor’s formula. Firstly, notice that, since is 
Frechet on C([—T, 0]), for every rji e C([— T, 0]), with ^i(O) = r](0), from the 
fundamental theorem of calculus we have 

(r]) - (r]i) = j Df'^(r] + X(r]i-r]))(r](x)-r]i(x))dx 




Recalling from Remark 3 the density of (O) ( [ — T , 0] ) in ( 0 ) ( [ — T , 0] ) with respect 

to the topology of "^([—T, 0]), we deduce the following Taylor’s formula for u: 



u{r}) - uirji) = (/t- ^ 



+ X{rj\ — r]))(r]{x) - r]\{x))dx \dX, (31) 



for all rji g "^^( 0 )([— T, 0]). As a matter of fact, for any 5 g]0, T /2] let (similarly to 
Remark 3(i)) 



, . -T <x < -8, 

■” U('n(0)-'?i(-5))^ + '?i(0), -s<x<o 

and rji^o := r]i. Then rji^s g C([— T, 0]), for any 8 g]0, T/2], and rji^s r]\ in 
^([-r, 0]), as 5 ^ 0+. Now, define / : [-T, 0] x [0, 1] x [0, T/2] ^ M as follows 

f(x,X,8) := Dfu(r] + X(r]i^8-r]))(r](x)-r]i^8(x)), 

for all (x, X, 8) G [-T, 0] x [0, 1] x [0, T /2]. Now (k, 8) \-^ r] X(r]i^8 ~ is 
continuous. Taking into account that D^^u : ^([—T, 0]) ^ BY {[—T, 0]) is contin- 
uous, hence bounded on compact sets, it follows that / is bounded. Then, it follows 
from Lebesgue dominated convergence theorem that 



Functional and Banach Space Stochastic Calculi . . . 



53 



ar 

(/-/<*■ 



Df<^ {r] + X(r]i^s ~ r])){r]{x) - r]i^s{x))dx ]dX 



} 



nl / rO 



f(x, X, 0)dx \dX 



)‘ 



from which we deduce (31), since ^ {xii,&) uixii) as 5 ^ 0+. Taking /7i(-) = 
r}{- - e)l[_7’,o[ + ??(0)1{0), we obtain 



u(r]) - u{r]{- - e)l[-r,o[ + ??(0)1{0}) 
s 

= (/ ^ — —dx^dX 

= s) + hin, s) + h{r], s), 



where 



h(r],s):= ^r]{x)^{^Dfu{’n + X{r]{--E)--q{-))\[-Tfi[) 

- Df^^u{r] + X[rj(- - e) - r]{-))l[-T,o[)^dx^dX, 
h(Jl,s)-= ^ {^j r]{x)Dfj^^u{p +X[r){- - e) - r){-))\[-Tfi[)dx^dX, 

Isir], s) := -^ ri(x)Dl\^u{ri + X{r]{- - s) - r]{-))l[-TM)dx^dX. 



Notice that, since r](x) = 0 for x < — T, we see that s) = 0. Moreover 

Dfu(^) = for X > 0, and + -s)- ^(-))l[-r,0[ ^ ^ in ^([-T, 0]) 

as £ ^ 0+. Since is continuous from "^([— T, 0]) into BV([—T, 0]), we have 
D^^u(r] + X(t]{’ - s) - ^(-))l[-r,0[) ^ as e ^ 0+. Then 



1 

- / ri{x)Dfj^^u{r] + X{r^{- - e) -r]{-))\[-Tfi[)dx 
^ J —8 

= -[ r](x)dxD^u{r] + X{r](--s)-r](-))l[-T,0[) r]{G)Dl^u{r^). 

^ J —8 



So hi^l, s) r](0)DQ^u(r]). Finally, concerning Ii(r], s), fromFubini’s theorem we 
obtain (denoting r]e^x := >7 + ^(??(- - e) - »7(-))l[-r,0[) 
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I\{r],s)= ^r]{x)^{^Dfu{r]e,x) - Dfj^^u{r]e,x))d:^dX 

= -/'(/ 1 / r^{x)d^Df u{rje,x)]dX 

Jo \J]-T.s] £ \Ji-T)V(y-s) / / 



= In(ri,e) + h 2 {ri,e). 



where 



In(rt,s):= - f( [ T 

Jo \i]-r,s]e\ 


rOAy \ 

/ T](x)dx I 

J{-T)v{y-8) / 


h 2 (ri,e) := - [U f -( 

Jo \J]-T,s]S\ 


rOAy \ 

/ rj{x)dx 1 

J{-T)v{y-s) ) 



\dX 



1 / /-OAy 



--{UM. 



n-T,s] ^ \J{-T)y{y-s) 

^ac 

^0 



r](x)dx 



'^DfyUiri). 



Recalling that D^u{’) = D^u{-), for x > 0, we see that in hi{r], s) and /i 2 (^, s) 
the integrals on ] — T, e] are equal to the same integrals on ] — T, 0], i.e., 

1 / 



hi{ri,s)= - [^ ( [ -( r 

Jo \J]-T,0] £\J(- 

--I'il -U 

Jo \J]-T,0]£\Jy 



T)V{y-s) 



rf{x)dx 



]-r,0] ^ \Jy-s 
1 / 



Y]{x)dx 



{-T)V{y-s) 



rj{x)dx 



h2{r),8)= - f -( [ 
J]-T,0] £ \Ji- 



'^(DfyUirj.^x) - DfyU(rj))yx, 
)DfyU(r,) 



Now, observe that 



£-> 0 + 



|/ll(^,^)l < Moc\\D^^u(rjs,x) - 0. 

Moreover, since r] is continuous at g] — T, 0], we deduce that r\{x)dxl8 
T](y) as £ ^ 0+. Therefore, by Lebesgue’s dominated convergence theorem, we get 



h 2 (^,s) - f r](y)Df u(r]). 

J]-T,0] 



Functional and Banach Space Stochastic Calculi . . . 



55 



So /i(^, s) ^ Qj r]{y)D^yU{r]). In conclusion, we have 

D^^iri) = ri(0)DQ^u(r]) - [ t]{y)Df u(r]). 

J]-T,0] 

Notice that we can suppose, without loss of generality, (rj) = (t]). 

Then, the above identity gives (29) using the integration by parts formula (12). □ 



For our second representation result of we need the following generaliza- 

tion of the deterministic backward integral when the integrand is a measure. 

Definition 18 Lota < Z? be two reals. Let/: [a,b] M be a cadlag function (resp. 
cadlag function with f(a) = 0) and g e ^([—T, 0]). Suppose that the following limit 




g{ds)d~'~f{s) 




g(ds)d f(s) 



lim [ 

«^ 0 + J[a,b] 

lim [ 

«^ 0 + J[a,b] 



gids) 

g(ds) 



fj(s) - fjjs - e) 
s 

fjjs + e) - fjjs) 
£ 



(32) 

(33) 



exists and it is finite. Then, the obtained quantity is denoted by 

( gd~ f) and called (deterministic, definite) backward (resp. forward) inte- 
gral of g with respect to / (on [a, h]). 

Proposition 8 Ifg is absolutely continuous with density being cadlag {still denoted 
with g) then Definition 18 is compatible with the one in Definition 4. 

Proof Suppose that g{ds) = g{s)ds with g cadlag. 

Identity (32). The right-hand side of (6) gives 



Ja £ 

which is also the right-hand side of (32) in that case. 

Identity (33). The right-hand side of (5) gives, since f{a) = 0, 



1 fj(s -h £) 

-g(a) f(s + s)ds+ g(s)'-d^ ^ 

^ Ja—s Ja ^ 






ds 



The first integral goes to zero. The second one equals the right-hand side of 
(33). □ 
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Proposition 9 Let r] e C([—T, 0]) be such that the quadratic variation on [— T, 0] 
exists. Let ^ : C([— T, 0]) ^ R be twice continuously Frechet differentiable such 
that 

: C([— r, 0]) — ^ xo C (C([— r, 0])(i);rC'([— r, O]))* continuously with respect to xo- 
Let us also suppose the following. 

( i) (q), the diagonal element of the second-order derivative at q, has a 
set of discontinuity which has null measure with respect to [q] (in particular, if 
it is countable). 

(ii) There exist continuous extensions (necessarily unique): 



In particular, the backward integral in (34) exists. 

Proof Let q G C([— L, 0]). Using the definition of we are led to consider 

the following increment for the function u : 



with e > 0. Our aim is to expand (35) using a Taylor’s formula. To this end, since 
^ is Frechet, we begin noting that for every q\ e C([—T, 0]) the following 
standard Taylor’s expansion holds: 



u: ^([-T, 0]) ^ M, 



Dj,^yu:n[-T,0])^ XO 



of^ and respectively. 

(Hi) The horizontal derivative (q) exists at q e C({—T, 0]). 

Then 



= [ Di.^(n)d+n(x) - - f o}^'‘'^‘^(ri)d[r]]{x). (34) 




u(r]) - u{r}{- - e)l[-r,o[ + '?( 0 ) 1 { 0 |) 



(35) 



s 



^ (rji) = ^ {r}) + Ddx'^ {vi){ril(x) - rj(x)) 




+ l f - rj(x)){r]i(y) - rj(y)) 

4 J[-r,0]2 
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Now, using the density of C^( 0 )([— 0]) into "^^( 0 )([— T', 0]) with respect to the 
topology of^([— r, 0]) and proceeding as in the proof of Proposition 7, we deduce 
the following Taylor’s formula for u\ 



u{7]) - - e)l[_7^o[ + ^(0)1{0}) 



-L 



[-T01 



-U 



D 



[-TOP 



dx dy 






Wx) - ri(x - e))(r](y) - r){y - e)) 






(36) 

l[-r,0[x[-r,0[(-^^ y) 
0 [) 



- D 



dx dy 






iv(x) - nix - s))(ri(y) - r){y - e)) 



l[-r,0[x[-r,0[(^> y) 



'jdX. 



Recalling the definition of xo given in Remark 1 1 , we notice that (due to the presence 
of the indicator function 1 [_t’ o[x[-r,0[) 



i 



D 



[-TOP 



dx dy 



(rj) 



(riix) - ri(x - s))(r](y) - r](y - e)) 



l[-r, 0 [x[-r, 0 [(-^- y) 



J[-T,of e 



+ 



i 



D 



2,Diag 



^ (rj) 



- T](x - s)y 



dx, 



[-T01 



where, by hypothesis, the maps r] g ^([—T, 0]) u(r]) e L^([—T, 0]^) and 

T] G ^{[—T, 0]) G L^([—T, 0]) are continuous. In particular, (36) 

becomes 

u(t]) - u(r]y - £)l[-r, 0 [ + ^( 0 ) 1 { 0 }) 



= h(s) + his) + his) + his) + his), 

(37) 



where 



7i(e) ;= / 



rjix) — rjix — s) 



W x('?(-^)-'?(-^ -e))('?(y)- '?(}'- e)). . 

/2(e) := -r / £>;’v dxdy, 

^ J\-J 



/[-TOP 



/ 3 (e) ;= 



-u. 






(r]{x) - r]{x - s)y 



dx. 



[-TO] 
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/4(e) := - [\l-X)( [ (olfuir] + X{r,{- - s) - /7(0)l[-r,O[) 

Jo \/[-r,0]2 '' 

- 0.;f °^(,)) ~ ~ ~ ~ ‘'^ dxdy\iX. 

/j(s);= - f (d;'““*u(i( + «u(. - s)->|(-))l|-r,0[) 

Jo \J[-T,0] ^ 



Firstly, we shall prove that 

/ 2 (e) 0. (38) 

To this end, for every e > 0, we define the operator T^: L^([—T,0]^) Mas 
follows: 



Tsg 



i 



(n(x}-n(x-s))(r]{y)-r]{y-s)) u ^ t2„ m2^ 

g(x,y) dxdy, V g e L^{[-T,0Y). 

[-r,0]2 £ 



Then Tg e L^([—T, 0])*. Indeed, from Cauchy-Schwarz inequality. 



I ^ ,1 ,1 f (r]ix)-r](x-s})^(tj(y)-tj(y-s))^ 

llgllL2([-r,o]2)yy^ ^2 

— llgllL2([_7’^o]2) / dx 

J[-T,0] e 

and the latter quantity is bounded with respect to s since the quadratic variation of 
T] on [— r, 0] exists. In particular, we have proved that for every g e L^([—T,0]^) 
there exists a constant Mg > 0 such that 



sup \Teg\ < Mg. 

0<8<\ 

It follows from Banach- Steinhaus theorem that there exists a constant M > 0 such 
that 



sup \\Ts\\l^{[-T,0])* — 
0<e<l 



Now, let us consider the set := {g € L^([—T, 0]^) : g{x, y) = e(x) f(y), with 
e, / e C^{[—T, 0])}, which is dense in L^([— T, 0]^). Let us show that 



Teg 



e^0+ 



0, 'ig&y. 



(40) 
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Fix g G with g(x, y) = e{x)f(y) for any (x, y) e [-T, 0], then 

Teg = - [ e(x){rj(x)-r](x-s))dx [ f iy){r]{y) - r]{y-s))dy. (41) 

e J[-T,0] J[-TM 



We have 



/ e(x)(^T](x) — T](x — s))dx = / [e(x) — e(x s))r](x)dx 

J[-T,0] J[-T,0] 



I 

J[-T-e- 
s(j \e 



e{x + s)r](x)dx + 



/ . 



e(x + £)r](x)dx 



e(x)\dx + 2||^||c 






OO I II viloo- 



Similarly, 



[ f(y){riiy) - rjiy - s))dy < s( [ + 2||/||oo) ll»7lloo- 

J[-T,0] \J[-Tfi] / 



Therefore, from (41) we find 



\Teg\ < e 



(/ 

\d[-r,o] 



e{x)\dx + 2||e||( 



=)(/ 

/ \ J[-r,0] 



\f{y)\dy + 2\\fl 






OO I IK/lloo’ 



which converges to zero as s goes to zero and therefore (40) is established. This in 
turn implies that 



^g €L\[-T,0f). (42) 

Indeed, fix g g L^([—T,0]^) and let (gn)n C ^ be such that ^ g in 
L^([-T, 0]^).Then 

\Ts g\ < \Tg(g — gn)\ \Tg gn\ < \\Ts\\L2(^^_j^ Q^2^*\\g — gn\\L^([-T,0]^)~^\Tsgn\- 

From (39) it follows that 

\Ts g\ < M\\g — gn\\L2q^_j^ Q^2^ \Ts gn\, 

which implies lim supg^Q+ \Tg g\ < M\\g — g^ II L2([-r,oF) • Sending n to infinity, we 
deduce (42) and finally (38). 

Let us now consider the term (s) in (37). Since the quadratic variation [rj] exists, 
it follows from Portmanteau’s theorem and hypothesis (i) that 
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/ 3 (e) = [ ^ — —dx — ^ 

J[-T,0] S e^0+ J[-T,0] 

Regarding the term /4(e) in (37), let 0^ : [0, 1]^ ^ L^([—T, 0]^) be given by 

(pn{s,X)(-,-) = + Xitji- - s) - r](-))l[-TM)- 

By hypothesis, 0^ is a continuous map, and hence it is uniformly continuous, since 
[0, 1]^ is a compact set. Let denote the continuity modulus of 0^, then 

\\D^:^^u{tj + X(r]i- -s) - /7(0)l[-r,o[) - 

= II077(^» “ 0?7(O» ^)llL2([-r,0]2) — P(/)rj(^)' 

This implies, by Cauchy-Schwarz inequality. 



lo rop u(r] + X(ri(- -e)-r](-))l[_T,oO 



D 



rp^(ri)) 



(t](x) - t](x - e))(r](y) - rj{y - e)) 



dxdy \dX 






rl 2 

< (i -X)\\D^:‘^ u(ri + X(r](- -e) 

< [\l-X)p^ (s)( [ 

Jo ’ \J[-TM 

= Ip^,(s) I 



(r)(x) - rj{x - s))^{rj{y) - r](y - e))2 



[-r,0]2 

(ri(x) - rj{x - e))2 



dxdydX 



‘) 



dx \dX 



(r](x) - r](x -s)y s-^0+ „ 
dx — ^ 0. 



/[-TO] 



Finally, we consider the term /5(e) in (37). Define 0^ : [0, 1]^ ^ L^([— T, 0]) as 
follows: 

Vf^(e, >.)(■) = +;.(??(■ -e) -??(•)) l[-r,0[)- 



We see that 0^ is uniformly continuous. Let denote the continuity modulus of 
0^ , then 

+ X(r,i- - e) - /7(0)l[-r,0[) - 
= IIV^f)(e. A.)||z,oo([_7-_o]) < Pi/f,(e). 
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Therefore, we have 



[ (!->•)(/ + X(r](- - s) - r](-))l[-T,oO 

Jo \J[-T,0] ^ 

^ /<'-«(/ 

Jo \Jl-T, 



(ri(x) - r](x - s)y 






dx \dX 



Pir,(e) 



(r]{x) - r]{x - s)y 



■) 



dx \dX 



1 



I 



{r]{x) - T](x -s)y s-^0+ „ 
dx — ^ 0. 



[-TO] 



In conclusion, we have proved that all the integral terms in the right-hand side of 
(37), unless I\ (s), admit a limit when s goes to zero. Since the left-hand side admits 
a limit, namely ^ deduce that the backward integral 



h(s) 



i 



[-TO] 



(ri) 



r](x) - r](x - s) s->o+ 



i 



[-r,0] 






exists and it is finite, which concludes the proof. 



□ 



3 Strong- Viscosity Solutions to Path-Dependent PDEs 

In the present section we study the semilinear parabolic path-dependent equation 
dt^ + = 0, V(f, ??) € [0, rLxCCL-r.O]), 

^(r, ??) = H(r]), V?? e c([-r,0]). 

(43) 

We refer to = dt‘^ + as the path-dependent heat oper- 

ator. The results of this section are generalized in [9, 10], where more general 
path-dependent equations will be considered. Here we shall impose the following 
assumptions on H and F. 

(A) H \ C([— T, 0]) ^ M and F \ [0, T] x C([— T, 0]) x M x M ^ M are Borel 
measurable functions and satisfy, for some positive constants C and m, 

\F{t, t], y, z) - F{t, r], y', z')\ < C(|)> - y'\ + k - z'\), 

\H{n)\ + \F{t,ri,0,0)\ < C(1 + ||/7C). 

for all (r, r]) e [0, T] x C{{—T, 0]), y, y' e M, and z, z! e M. 



62 



A. Cosso and F. Russo 



3.1 Strict Solutions 



In the present subsection, we provide the definition of strict solution to Eq. (43) and 
we state an existence and uniqueness result. 

Definition 19 A function ^ : [0, T] x C([— T, 0]) ^ M in C^’^(([0, T[xpast) x 
present) H C([0, T] x C([— T, 0])), which solves Eq. (43), is called a strict solution 
to (43). 

We now introduce some additional notations. Let (f2, P) be a complete prob- 
ability space on which a real Brownian motion W = {Wt)t>Q is defined. Let 
F = denote the completion of the natural filtration generated by W . 

• S^(A T), p > \, 0 < t < T , the set of real cadlag F-adapted processes Y = 
(f 5 )?< 5 <r such that 






• T)^, /? > 1, 0 < r < r, the set of -valued predictable processes Z = 

(Zs)t<s<T such that 



\\Z\\P , := E 

mP(t,T)d 




< oo. 



We simply write T) when d = 1. 

• A+’^(a r), 0 < t < r, the set of real nondecreasing predictable processes K = 
(Ks)t<s<T e T) with Kt = 0, so that 






• M"^), p > 1, 0 < t < r, the set of -valued F-predictable processes 

0 = ((ps)t<s<T such that 



II0II 



P 



:= E 




< 00 . 



Definition 20 Let t e [0, T] and r] g C([—T, 0]). Then, we define the stochastic 
flow 



Wj’^(x) 



T](x s — t), —T < X < t — s, 

r](0) + Wjc+s -Wt, t - s < X <0, 



for any t < s < T. 
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Theorem 4 Suppose that Assumption (A) holds. Let ^ : [0, r]xC([— T, 0]) ^ M 
he a strict solution to Eq. (43), satisfying the polynomial growth condition 

< c(l + ||??||™), V(f,/ 7 )e[ 0 ,r]xC([-r, 0 ]). (44) 



for some positive constants C and m. Then, we have 

rj) = Yp, V (r, ri) e [0, T] x C{[-T, 0]), 

where (sM’"'), D'' ^ (sM''')hr,T[(s))se[t,T] e 

(t, T) X H^(r, T) is the solution to the backward stochastic differential equation: 
¥-a.s., 



F/’'' = H(W^y) + / F{r,W‘ff, Yp, Z*;^)dr - / Z['^dWr, t <s <T. 
J S J S 

In particular, there exists at most one strict solution to Eq. (43). 

Proof Fix (t, q) G [0, T[xC([— T, 0]) and set, for dX\t < s <T , 

Yl’^ = Z’;^ = £>'"‘^(5, Wj’")l[r,r[(^). 

Then, for any Tq e [t, T[, applying Ito formula (28) to ‘W (^, Wj’ *^) and using the 
fact that ^ solves Eq. (43), we find, P-a.s., 

rTo pTq 

Y^’^ = F^;''+ / F{r, Yp, Z[’'^)dr- / Z^’^^dWr, t <s <Tq. (45) 

J s J s 

The claim would follow if we could pass to the limit in (45) as Tfi ^ T . To do this, we 
notice that it follows from Proposition B. 1 in [10] that there exists a positive constant 
c, depending only on T and the constants C and m appearing in the statement of the 
present Theorem 4, such that 



pTo pT 

EJ^ < c||F^’"|||2(^_y)+cEy^ |F(r,W;:''',0,0)|2dr, V7b€[f,T[. 

We recall that, for any q > I, 



E 




< 00 . 



(46) 



Notice that from (44) and (46) we have ||T^’^|ls2(r,r) < c>o, so that Y e S^(t, T). 
Then, from the monotone convergence theorem we find 
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nT pT 

E \F{rMr'\0,Q)\^dr. 

Therefore, it follows from the polynomial growth condition of F and (46) that 
Z G T). This implies, using the Lipschitz character of F in (y, z), that 

E \F(r, Wr^ , t/’^, Zr^)\^dr < oo, so that we can pass to the limit in (45) and 
we get the claim. □ 

We conclude this subsection with an existence result for the path-dependent heat 
equation, namely for Eq. (43) with F = 0, for which we provide an ad hoc proof. 
For more general cases we refer to [9]. 

Theorem 5 Suppose that Assumption (A) holds. Let F = 0 and H be given by, for 
all q G C([—T, 0]), (the deterministic integrals are defined according to Definition 

4(i)) 



H{q) 






[-r,0] 



(p\{x + T)d r](x) 



■L 



[-TO] 



(pN(x-\-T)d q(x)], (47) 



where 

• h belongs C^(M^) and its second order partial derivatives satisfy a polynomial 
growth condition, 

• q>u...,cpN & C2([0, T]). 

Then, there exists a unique strict solution ^ to the path-dependent heat Eq. (43), 
which is given by 

rj) = E[//(W^y)], V (?, rj) e [0, T] x C{[-T, 0]). 



Proof Let us consider the function ^ : [0, T] x C([— T, 0]) ^ M given by, for all 
(t,q)e[0, T]xC([-T, 0]), 



^ (t, q) = E[i7(W^^)] 



= E|/z 
= F 



(t 

('I 



(pl(x+t)d r](x) + / (pi(s)dW. 






(pl(x + t)d r](x),..., (pN(x + t)d r](x) 



)■ 



where 

F(t, x\, . . . , xn) 



— E /z I j 



T pT 

(pi(s)dWs, ...,xn A- / (pN(s)dWs 
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for any , xn) c [0, T] x . Notice that, for any /, 7 = 1, . . . , 



Dxf x\, . . . , Xn) = E 
Dl.^.^{t,xi,...,XN) = E 




(pi(s)dWs,...,XN -\- 
T 

(p\{s)dWs, ...,XN 



j (pN{s)dW, 



)]■ 



<PN(s)dWi 



)]■ 



SO that ^ and its first and second spatial derivatives are continuous on [0, T] x 
Let us focus on the time derivative of We have, for any 5 > 0 such that 
t + 8 e[0, T], 



8, x\, . . . , Xn) — ^(t,xi, . . . , Xn) 



= 



Xi+ (pi(s)dWs 

Jt+s 






Then, using a standard Taylor formula, we find 



8, xi, ..., Xn) — xi, . . . , Xn) 



8 



= — E 
5 



rl ^ ^ / rT 

7 7 (p\{s)dWs-a ipi{s)dWs, 



(48) 




(Pi(s)dWsda . 



Now, it follows from the integration by parts formula of Malliavin calculus, see, e.g., 
formula (1.42) in [31] (taking into account that Ito integrals are Skorohod integrals), 
that, for any i = 1, . . . , N , 



E 



"^Xih^xi <^i(^)(l - al[tj+8]{s))dWs, . . (Pi(s)dWi 

(1 - a)E^^^Dl.^.h^xi + (pi(s){^ - al[t,t+8](s))dWs, . . (pi(s)(pj(s)ds 



(49) 



Then, plugging (49) into (48) and letting 8 0+, we get (recalling that D^.^ .h has 

polynomial growth, for any i, j) 



x\, ... , Xn) 



--E 

2 



r N 

z 

67=1 



Dt 



(fl(s)dWs, . . 






(50) 

for any (t, x\, . . . , xn) c [ 0, r[xM^, where 9 ^”^ ^ denotes the right-time derivative 
of ^ . Since ^ and ^ are continuous, we deduce that dt^ exists and is continuous 
on [0, T[ (see for example Corollary 1.2, Chap. 2, in [32]). Moreover, from the 
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representation formula (50) we see that dt^ exists and is continuous up to time T . 
Furthermore, from the expression of we see that 

1 ^ 

= -- Viit)(Pj{t)Dl.,^,'P{t,X\, . . .,Xn). 

i,i=\ 



Therefore, ^ e Ci’2([0, T] X M^) and is a classical solution to the Cauchy problem 

X) + i 2u=i Vi(t)Vjit)Dl,^^(t, X) = 0, V(r, X) e [0, T[xR^, 
^(T,x) = h(x), Vx G 

(51) 

Now we express the derivatives of ^ in terms of We begin noting that, taking 
into account Proposition 4, we have 

[ (pi(x -\-t)d~T](x) = T](0)(pi(t)- [ T](x)(pi(x -\-t)dx, W T] e C([-T,0]). 

J[-t,0] J-t 

This in turn implies that ^ is continuous with respect to the topology of "^([— T, 0]) . 
Therefore, ^ admits a unique extension u : "^([— T, 0]) ^ M, which is given by 



u(t, rf) = 



V 



(pi(x + t)d r](x),. 



n-t,o] 



(pN(x-\-t)d r](x) 



)■ 



forall(A^) G [0, T] x"^([—r, 0]). We also define the map w : [0, T] x"^([— T, 0[) x 
R ^ M as in (13): 

u(t,y,a) = u(t,yl[-T,0[-\-al{0}) = ^ , acptit) - J y{x)(pi{x-\-t)dx,...^, 

for all (t, y, a) G [0, T] x ^{[—T, 0[) x R. Let us evaluate the time derivative 
dt^ (t, rj), for a given (L rf) g [0, T[xC([— T, 0]): 



dt^ {t, rf) = dt^ 
N 



(pi{x-\-t)d T]{x),..., (pN(x-\-t)d T]{x)] 

-f,0] */[-?, 0] ) 



('■/ 

V h-t,^ 

I 



<Pi(x + t)d r](x) 



■■■■ML 



(Pi(x + t)d T]{x) 



) 



Notice that 



9? 



(/ ■ 



(pi{x + t)d r](x) 






rj(x)(pi(x + t)dx 



r](0)(p(t) - r](-t)(pi(0'^) - / r](x)^i(x -\-t)dx. 
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Let us proceed with the horizontal derivative. We have 









H; 



= lim 

£-^ 0 + 



u{t, ??|[-r,0[(-), »7(0)) - M(f, »7|[-r,0[(- - e), »7(0)) 



- , r]{G)(pi{t) - J 



T](x)(pi(x + t)dx 



Tfj{x — s)(pi{x + t)dx, 



) 

)) 



From the fundamental theorem of calculus, we obtain 






■/: 



r]{Qi)(Pi{t) - / Y](x)(pi(x + t)dx, 



) 



T](0)(Pi(t)- / T](x — s)(pi(x + t)dx. 



L , r]{0)ipi(t) - J 

/: 



) 



/ 



ri(0)(pi(t)- T](x - y)<pi(x + t)dx. 



H 



ri(0)<pi(t) 



-J t](x - y)(pi(x + t)dx'^dy. 
Notice that 



- j 1 



dy\rj(0)<pi(t) - I r](x - y)<pi(x + t)dx ) = - 

r-y 



)-<o 



T](x)<Pi(x + )> + t)dx 



= -^tl(-y)<Pi(t) - fli-t - y)<Pi(0'^) + j r](x)^i(x + y + t)dx'^. 



Therefore 

r]) 



1 ^ 

= - lim - / 'yD^i'P\t,...,r](Q)(pi{t)- / 

e^0+ S Jo \ J-t 



rj{x — y)tpi{x + t)dx. 






y)<Pi(t) 



- rii-t - y)<pi 



ft(0+) + J 



r\{x)lpi{x + J + t)dxjdy 



= - J v(x)4>i(x + t)dx, . . 



/: 



— / rj{x)^i{x t)dx 



) 
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Finally, concerning the vertical derivative we have 

= D'^u{t,r])= ??l[_7-,o[ + »?(0)1{0)) 

= / (p\{x -\-t)d~ri{x), 

7^, V J[-tM / 



and 



= D''''u{t,n)= d^^i(t,rili-T.O[ + ri(On{0}} 



ij = l ^ 



(pi(x-\-t)d r]{x), ... ](pi{t)(pj{t). 



From the regularity of ^ it follows that ^ G C^’^(([0,r] x past) x present). 
Moreover, since ^ satisfies the Cauchy problem (51), we conclude that dt^(t,r]) + 
+ = 0,ior&\l {t,rf} e [0, 7’[xC([-7’, 0]), therefore 

is a classical solution to the path-dependent heat Eq. (43). 



3.2 Towards a Weaker Notion of Solution: A Significant 
Hedging Example 



In the present subsection, we consider Eq. (43) in the case F = Q. This situa- 
tion is particularly interesting, since it arises, for example, in hedging problems of 
path-dependent contingent claims. More precisely, consider a real continuous finite 
quadratic variation process A on (f2, P) and denote X the window process asso- 
ciated to X. Let us assume that [X]t = t, for any t e [0, T]. The hedging problem 
that we have in mind is the following: given a contingent claim’s payoff H(Kt), is 
it possible to have 

H(Xt) = Ho+ [ Ztd~Xt, (52) 

Jo 

for some Hq e R and some F-adapted process Z = (Zt)te[0,T] such that = 
v(L with V : [0, T] x C([— T, 0]) ^ M? When X is a Brownian motion W and 
\Zf\^dt < 00 , F-a.s., the previous forward integral is an Ito integral. If H is 
regular enough and it is cylindrical in the sense of (47), we know from Theorem 5 
that there exists a unique classical solution"^ : [0, T]xC([— T, 0]) ^ MtoEq. (43). 
Then, we see from Ito’s formula (28) that ^ satisfies, F-a.s., 

= ‘^(0,Xo)+ / D'^‘^{s,^s)d~Xs, 0<t<T. (53) 

Jo 

In particular, (52) holds with Z^ = D^'^(L X^),forany r g [0, T], Hq = ^ (0, X^). 
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However, a significant hedging example is the lookback-type payoff 
H(t]) = sup T](x), Vt] eC([-T,0]). 

xe[-T,0] 

We look again for ^ : [0, T] x C([—T, 0]) ^ M which verifies (53), at least for X 
being a Brownian motion W. Since ^ (t, Wt) has to be a martingale, a candidate for 
is (t, T]) = E[^(W^^)1, for all (t, r]) e [0, T] x C([—T, 0]). However, this 
latter ^ can be shown not to be regular enough in order to be a classical solution 
to Eq. (43), even if it is “virtually” a solution to the path-dependent semilinear Kol- 
mogorov equation (43). This will lead us to introduce a weaker notion of solution 
to Eq. (43). To characterize the map we notice that it admits the probabilistic 
representation formula, for all (t, r]) e [0, T] x C([— T, 0]), 

^{t,T])= E[F^(W'^^)] sup W'^^(x)l 

’--r<jc<o -I 

= Er( sup rj(x)) V ( sup (W;c ~ Wt + ^(0)))1 

‘-^-t<x<0 ' ^t<x<T 

where the function / : [0,r]xMx]R^Ris given by 

/(C m, x) — E[m V (Sr-t + -^)], V (t, m, x) c [0, T] x M x M, (54) 

with St = supQ<^<^ Ws, for all t e [0, T]. Recalling Remark 3, it follows from the 
presence of sup_^<^<Q r](x) among the arguments of /, that ^ is not continuous 
with respect to the topology of ^ ([— T, 0]), therefore it can not be a classical solution 
to Eq. (43). However, we notice that sup_^<^<Q r](x) is Lipschitz on (C ([— T, 0]), || • 

I loo), therefore it will follow from Theorem 7 that is a strong-viscosity solution 
to Eq. (43) in the sense of Definition 21. Nevertheless, in this particular case, even 
if ^ is not a classical solution, we shall prove that it is associated to the classical 
solution of a certain finite dimensional PDE. To this end, we begin computing an 
explicit form for /, for which it is useful to recall the following standard result. 

Lemma 1 (Reflection principle) For every a > 0 and t > 0, 

E(^r >^) = n\Bt\ >a). 

In particular, for each t, the random variables St and \ Bt \ have the same law, whose 
density is given by: 



fit, sup 

^ -t<x<0 ' 






e l[0,oo[(z)> 



Vz € M. 
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Proof See Proposition 3.7, Chapter III, in [35]. 



□ 



From Lemma 1 it follows that, for dX\ (t , m , x) G [0, L[xR x M, 

2 / z \ 

f(t,m,x) = / mv{z-\-x)(pT-tiz)dz = / mv(z-\-x) ^\ / 

Jo Jo — — 

where (p{z) = tx^{z^ jl) j , z G M, is the standard Gaussian density. 

Lemma 2 The function f defined in (54) is given by, for all (t,m,x) G [0, r[xM x 






2{T — t) 

2 ( 7-0 . 



for X < m, and 



f(t, x,m) — X + 



2{T - t) 



for X > m, where ^{y) = cp(z)dz, y ^ R, is the standard Gaussian cumulative 
distribution function. 

Proof First case: x <m. We have 

nm—x 2 / Z \ 700 2 / Z \ 

/(,,„.,) . I (55) 

The first integral on the right-hand side of (55) becomes 

I' 

where 0 (y) = (p(z)dz, y G R, is the standard Gaussian cumulative distribution 
function. Concerning the second integral in (55), we have 

POO 2 / z \ 

<piz)dz 

VT^ y/T^t 

2{T — t) _(m^xF ( 
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Second case: x > m. We have 







oo roo 

z(p{z)dz -\-2x (p(z)dz = 
Jo 



2{T -t) 



7t 



+ X. 



We also have the following regularity result regarding the function /. 

Lemma 3 The function f defined in (54) is continuous on [0, T] x M x M, moreover 
it is once (resp. twice) continuously differentiable in (t, m) (resp. in x) on [0, T[x Q, 
where Q is the closure of the set Q := {(m, v) g M x M: m > x]. In addition, the 
following ltd formula holds: 

fit, S,, B,) = f(0, 0, 0)+j‘ (d,f(s, Ss, Bs) + Ss, Bs))ds (56) 

+ [ dmf(s, Ss, Bs)dSs + [ dxf(s, Sg, Bs)dBs, ^ <t <T, 

Jo Jo 

Proof The regularity properties of / are deduced from its explicit form derived in 
Lemma 2, after straightforward calculations. Concerning Ito’s formula (56), the proof 
can be done along the same lines as the standard Ito formula. We simply notice that, 
in the present case, only the restriction of / to 2 is smooth. However, the process 
{{St, Bt))t is 2 -valued. It is well-known that if Q would be an open set, then Ito’s 
formula would hold. In our case, Q is the closure of its interior Q . This latter property 
is enough for the validity of Ito’s formula. In particular, the basic tools for the proof 
of Ito’s formula are the following Taylor expansions for the function /: 

f{d, m, x) = f{t, m, x) -h dtf{t, m, x){t' — t) 

+ [ dtf{t-\-Ht' — t),m,x){t' — t)dX, 

Jo 

f{t, m' , x) = f{t, m, x) -h dmf{t, ni, x){m' — m) 

+ / dfnf{t,rn-\-k{m' — m),x){m' — m)dk, 

Jo 

f{t, m, x') = f{t, m, x) -h dxf{t, m, x){x' — x) -h ~ 

-h [ {I — k)(d^^f{t, m, X k{x' — x)) — d^^f{t, m, x)){x' — x)^dk, 

Jo 

for all {t, m, x) e [0, T] X 2- To prove the above Taylor formulae, note that they 
hold on the open set Q, using the regularity of /. Then, we can extend them to the 
closure of Q, since / and its derivatives are continuous on Q. Consequently, Ito’s 
formula can be proved in the usual way. □ 
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Even though, as already observed, ^ does not belong to C^’^(([0, T[xpast) x 
present) nC([0,r]xC([— T, 0])),so that it can not be a classical solution to Eq. (43), 
the function / is a solution to a certain Cauchy problem, as stated in the following 
proposition. 

Proposition 10 The function f defined in (54) solves the backward heat equation: 



Proof We provide two distinct proofs. 

Direct proof Since we know the explicit expression of /, we can derive the form 
of dtf and d^^f by direct calculations: 



for all (t, m, x) e [0, T[ X 2, from which the claim follows. 

Probabilistic proof By definition, the process (/(C St, Bt))te[0,T] is given by: 



so that it is a uniformly integrable F-martingale. Then, it follows from Ito’s formula 
(56) that 



for all 0 < t < T, P-almost surely. As a consequence, the claim follows if we prove 
that 



dtf(t,m,x)-\-^d^^f(t,m,x) = 0, 
f(T,m,x) = m, 



V (t, m, x) G [0, r[x Q, 

V (m, x) G Q. 




fit, Sr, B,) = E[5r|#-f], 





(57) 



By direct calculation, we have 





Therefore, (57) becomes 






( 58 ) 
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Now we observe that the local time of Ss — Bs is equal to 2Ss , see Exercise 2. 14 in [35] . 
It follows that the measure dSs is carried by [s : Ss — Bg =0}. This in turn implies 
the validity of (58), since the integrand in (58) is zero on the set {s : Ss — Bg =0}. 

□ 



3.3 Strong-Viscosity Solutions 

Motivated by previous subsection and following [10], we now introduce a concept of 
weak (viscosity type) solution for the path-dependent Eq. (43), which we call strong- 
viscosity solution to distinguish it from the classical notion of viscosity solution. 



Definition 21 A function ^ : [0, T] x C([— T, 0]) ^ M is called strong-viscosity 
solution to Eq. (43) if there exists a sequence ("%, Hn, Fn)n of Borel measurable 
functions : [0, T] x C([-T,0]) ^ M, : C([-T,0]) ^ M, : [0, T] x 
C([— r, 0]) X M X M ^ M, satisfying the following. 

(i) Eor all r G [0, r], the functions Fn{t, diVt equicontinuous 

on compact sets and, for some positive constants C and m, 

\Fnit, T], y, z) - Fn(t, r], y, z')\ < C(\y - y'\ + |z - z'|), 
ri)\ + \Hnm + \Fn(t, 11, 0, 0)| < C(l + MZ), 

for all (C T]) G [0, T] x C([— T, 0]), y, y' G M, and z, z' c M. 

(ii) is a strict solution to 

+ Fn(t, T], ^n) = 0, V (C 7]) G [0, r[xC([-T, 0]), 

' ^n(T,ri) = Hniri), V ^ G C([-T, 0]). 

(hi) (^„, Hn, Fn) converges pointwise to , H, F) as n tends to infinity. 

Remark 1 2 (i) Notice that in [8] , Definition 3 .4, instead of the equicontinuity on com- 
pact sets we supposed the local equicontinuity, i.e., the equicontinuity on bounded 
sets (see Definition 3.3 in [8]). This latter condition is stronger when ^ (as well as 
the other coefficients) is defined on a non-locally compact topological space, as for 
example [0, T] x C([— T, 0]). 

(ii) We observe that, for every t G [0, T], the equicontinuity on compact sets of 
{t, '))n together with its pointwise convergence io^{t, •) is equivalent to requir- 
ing the uniform convergence on compact sets of C%(^, -))n to ^{t, •)• The same 
remark applies to (if„(-))n and {Fn{t, •, •, '))n, ^ G [0, T]. □ 

The following uniqueness result for strong- viscosity solution holds. 
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Theorem 6 Suppose that Assumption (A) holds. Let ^ : [0, r]xC([— T, 0]) ^ M 
he a strong-viscosity solution to Eq. (43). Then, we have 

rj) = Yp, V it, ri) e [0, T] x C{[-T, 0]), 

where e T) x M^it, T), with yI'^ = {s, solves the 

backward stochastic differential equation, '^-a.s.. 



Yl'^ = H(W‘y) + / F(r,Wl.’^, Yp, Z*;^)dr - / Z*;^dWr, t <s <T. 

J S J S 

In particular, there exists at most one strong -viscosity solution to Eq. (43). 

Proof Consider a sequence ("%, Hn, En)n satisfying conditions (i)-(iii) of Definition 
21. For every n eN and any (t, q) e [0, T] x C([—T, 0]), we know from Theorem 
4that(Yp’\zy’\^^t.T] = i^n(s,Wl’’'), e S'^(t,T) x 

T) is the solution to the backward stochastic differential equation, P-a.s., 



= H„ 



) + F„(r, Z"’‘’”)dr - Z^/’^ldWr 



t < s < T. 



Our aim is to pass to the limit in the above equation as n ^ oo, using Theorem C.l 
in [10]. From the polynomial growth condition of (^n)n and estimate (46), we see 
that 



sup < 00 , foranyp>l. 

n 

This implies, using standard estimates for backward stochastic differential equations 
(see, e.g.. Proposition B.l in [10]) and the polynomial growth condition of (En)n, 
that 



sup||Z"’^’''||H2(f_r) < 00. 
n 

Let Ys’^ = ^ (s, Wj’^), for any 5' G [t, T]. Then, we see that all the requirements of 
Theorem C.l in [10] follow by assumptions and estimate (46), so the claim follows. 

□ 

We now prove an existence result for strong-viscosity solutions to the path- 
dependent heat equation, namely to Eq. (43) in the case E = 0. To this end, we 
need the following stability result for strong- viscosity solutions. 

Lemma 4 Let (^n,k, Hn,k, Fn,k)n,k, (^n, Hn, En)n, (^, H, E) be Borel measur- 
able functions such that the properties below hold. 

(i) Eorall t G [0, T\ the functions ^n,k(L f, cind F^,y^(L n,k 

are equicontinuous on compact sets and, for some positive constants C and m. 
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\Fn.kit, r], y, z) - Fn,k(t, rj, y' , z')\ < C{\y - y'\ + |z - z'\), 
\‘^n,kit, r^)\ + \Hn,kiri)\ + \Fn,kit, ri, 0, 0)| < C(l + ||??||™ ), 

for all {t, rj) e [0, T] x C{{—T, 0]), y, y' e M, and z, z! e R. 

( ii) ‘f^n,k is a strict solution to 

^t‘^n,k + F>^f^n,k + \F)^^ ‘^n,k 

■ +Fn,k{t,T],f/n,k,D^f/n,k) = 0, V(f,/ 7 ) e [0,r[xC([-r,0]), 

f/n,kiT,ri) = Hn,kir}), V e C([-r, 0]). 



(in) Fn,k) converges pointwise to Hn, Fn) as k tends to infinity, 

(iv) Hyi, Fyi) converges pointwise to , H, F) as n tends to infinity. 

Then, there exists a subsequence (^n,kn^ ^n,kn^ Fn,kn)n which converges pointwise 
to , H, F) as n tends to infinity. In particular, ^ is a strong-viscosity solution to 
Eq. (43). 

Proof See Lemma 3.4 in [8] or Lemma 3.1 in [10]. We remark that in [8] a slightly 
different definition of strong- viscosity solution was used, see Remark 12(i); however, 
proceeding along the same lines we can prove the present result. 

Theorem 7 Suppose that Assumption (A) holds. Let F = 0 and H be continuous. 
Then, there exists a unique strong -viscosity solution ^ to the path-dependent heat 
Eq. (43), which is given by 

= E[//(Wy'')], V(r,??) € [0, r] X C([-r,0]). 



Proof Let (e,),>o be the orthonormal basis of 0]) composed by the func- 

tions 




e 2 i-\(.x) 




{^(x + T)fi 



eiiix) = 







for all / G N\{0}. Let us define the linear operator A: C([— L, 0]) ^ C([—T, 0]) by 



n(0) - n(-T) 

(Aq)(x) = ^ e 0], q G C([-T, 0]). 

Notice that (q — Aq)(—T) = (q — Aq)(0), therefore q — Aq can be extended to the 
entire real line in a periodic way with period T , so that we can expand it in Fourier 
series. In particular, for each n eN and q G C([— L, 0]), consider the Fourier partial 
sum 

n 

n(r]-Ar]) = - (Ar])i)ei, W r] G C([-T,0]), 

i=0 






( 59 ) 
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where (denoting ei{x) = ei(y)dy, for any x G [-T, 0]), by Proposition 4, 



m 



/ T](x)ei(x)dx = T](0)ei(0) - / ei{x)d r](x) 
J-T J[-T,0] 



n-T,o] 



(ei(0) - ei(x))d r]{x), 



(60) 



since r](0) = Q^d r](x). Moreover we have 



-L 



(Ar])i = I {Ar]){x)ei{x)dx = ^ [ xei{x)dx 



(/ ' 

\ J[-r,o] 



d r](x) - r]{-T) 



(61) 



Define 






+ • • • + 

n I 



Then, by (59), 



Tl I 

aniri-Aii) = T ” ~\ rji-(A,i)i)ei, V rj e C([-T,0]). 

We know from Fejer’s theorem on Fourier series (see, e.g.. Theorem 3.4, Chapter 
III, in [44]) that, for any r] g C([— T, 0]), cfnir] — At]) ^ r] — Arj uniformly on 
[— r, 0], as ^ tends to infinity, and \\an{r] — At])\\oq < \\r] — Ar]\\^. Let us define the 
linear operator : C([—T, 0]) ^ C([—T, 0]) by (denoting e-i(x) = x, for any 
X G [-T, 0]) 



Z n 1 — i ii\\j ) — ii\— 1 ) 

—;—(rh - (Ar])i)ei H e_i 

. „ n + 1 T 



r]{Q) - r]{-T) 



1=0 



1=0 



Z fl \ — I 

Xi6i -\- x-\e-i, 

n + l 



(62) 



where, using (60) and (61), 



x_i = 



/ ? 
( 

J\_-TM \ 



—d T](x) - L(-r), 



e, (0) - ei(x) - Y I 



f '?(-*^) + ^ j ^xei(x)dxr]{-T), 
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for / = 0, . . . , n. Then, for any t] g C([—T, 0]), TnT] t] uniformly on [-T, 0], as 
n tends to infinity. Furthermore, there exists a positive constant M such that 

I|T;^||oo < Mll^lloo, V^gN, V^GC([-r,0]). (63) 



In particular, the family of linear operators (Tn)n is equicontinuous. Now, let us 
define Hn : C([— T, 0]) ^ M as follows 



Hnir]) = H{Tnr]), V ^ G C([-T, 0]). 



We see from (63) that the family {Hn)n is equicontinuous on compact sets. Moreover, 
from the polynomial growth condition of H and (63) we have 

\Hnm < C(i + ||r,^||-) < C(1 + M-||^||-), v^gN, v^GC([-r,0]). 

Now, we observe that since {^_i, e\, . . . , en] are linearly independent, then 

we see from (62) that Tnt] is completely characterized by the coefficients of 
e-i,eQ,ei , . . . , Therefore, the function hn : M given by 



hn (V— 1 



...,x„) = Hn(r]) = H\ > ——Xiei+x-ie-i], 

” + i / 



V(x_i,...,x„) G 



T)n+2 



completely characterizes Hn. Moreover, fix^ g C([— T, 0]) and consider the cor- 
responding coefficients v_i , . . . , with respect to {^_i , . . . , in the expression 
(62) of Tnif]. Set 

1 1 /'O 

(p-\{x)= (Pi{x) = e/(0) - e/(x - r) - - / xei(x)dx, x€[0,T], 

1 1 /"O 

a-i = , at = — xei(x)dx. 

T T 



Notice that cp-\, . . . ,(pn c C^([0, T]). Then, we have 



Hn(r])=hn( [ (p-i(x + T)d r](x)+a-ir](-T), 
\J[-T,0] 



■i 



[-r,0] 



(Pn(x + T)d r](x) +anT](-T) 



)■ 



Let (p(x) = cexp(l/(v^ — r^))l[o,r[(v), v > 0, with c > 0 such that (p(x) 
dx = 1. Define, for any s > 0, (ps(x) = (p(x/£)/s, x > 0. Notice that 0^ g 
C^([0, oo[) and (denoting 0e(v) = Jq (j)e{y)dy, for any v > 0), 



[ T](x)(ps(x -\-T)dx = t](0)^s(T)- [ ^s(x-\-T)d r](x) 

l-T d[-T,0] 



= [ {MT)-Mx + T))d-v(x). 

d\-T,0^ 
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Therefore 

lim [ (4>s(T) - ^s(x -\- T))d~r](x) = lim [ r](x)(l)s(x -\- T)dx = r](-T). 

£->0+ J[-T,0] £-^0+ J-T 

For this reason, we introduce the function Hn '. C([— T, 0]) ^ M given by 
finir]) = [ (piix -\-T)d~r]ix) -\-ai [ (0„(T) - 0„(x + r))<7“?y(x), . . . 

\ J[-T,0] J[-T,0] / 

Now, for any n e N, let (hn,k)keN be a locally equicontinuous sequence of 
^2(]^n+2; functions, uniformly polynomially bounded, such that hn,k converges 
pointwise to /z„, as k tends to infinity. Define Hn,k • C([—T, 0]) ^ M as follows: 

= hn,k( • • • , I (pi{x-\-T)d~r]ix)-\-ai [ (0„(T) - + r))j“?y(x), . . . 

\ J[-T,0] J[-T,0] / 

Then, we know from Theorem 5 that the function ^n,k • [0, T] x C([—T, 0]) ^ M 
given by 



n) = V it, / 7 ) e [0, T] x C{{-T, 0]) 

is a classical solution to the path-dependent heat Eq. (43). Moreover, the family 
C^n,k)n,s,k is equicontinuous on compact sets and uniformly polynomially bounded. 
Then, using the stability result Lemma 4, it follows that is a strong-viscosity 
solution to the path-dependent heat Eq. (43). □ 
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Nonlinear Young Integrals via Fractional 
Calculus 



Yaozhong Hu and Khoa N. Le 



Abstract For Holder continuous functions W(t,x) and cpt, we define nonlinear 
integral W(dt,(pt) via fractional calculus. This nonlinear integral arises naturally 
in the Feynman-Kac formula for stochastic heat equations with random coefficients 
(Hu and Le, Nonlinear Young integrals and differential systems in Holder media. 
Trans. Am. Math. Soc. (in press)). We also define iterated nonlinear integrals. 

Keywords Nonlinear integration • Young integral • Iterated nonlinear Young 
integrals 



1 Introduction 

Let [cpt > 0} be a Holder continuous function and let {W(t,x),t > 0 , x G M.^} 
be another jointly Holder continuous function of several variables (see (10) for the 
precise statement about the assumption on W). The aim of this paper is to define the 
nonlinear Young integral W (dt, ept) hy using fractional calculus. 

This paper can be considered as supplementary to authors’ recent paper [5], where 
the nonlinear Young integral is introduced to establish the Feynman-Kac formula for 
general stochastic partial differential equations with random coefficients, namely, 

dtu(t, x) + Lu(t, x) -h u(t, x)dtW(t, x) = 0 , (1) 
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where W is a Holder continuous function of several variables (which can be a sample 
path of a Gaussian random field) and 

^ d d 

Lu(t, x) = - CLij(t, X, x) + x, W)dxiU(t, x) 

^ ij=l ' ' i=l 

with the coefficients atj and bi depending on W . The terminal condition for the 
Eq. (1) is u(T, x) = ut{x) for some given function ut{x). 

To motivate our study of the nonlinear Young integral, let us recall a basic result 
in [5] on the Feynman-Kac formula: Let a{t,x) = {aij{t,x, yV))\<ij<d satisfy 
a(t, x,W) = a(t, X, W)a(t, x,wY (we omit the explicit dependence of a on W). 
Consider the following stochastic differential equation 

= a(t, + b{t, 0 < r < r < T, = x, (2) 



where (5^,0 < t < T)isa standard Brownian motion and 8Bt denotes the 
Ito differential. Then it is proved in [5] that under some conditions b and a and 
W (which are verified for certain Gaussian random field W), the nonlinear inte- 
gral W(ds, is well-defined and exponentially integrable and u(r, x) = 



E® I 



exp 



jj W (ds, j I is a Feynman-Kac solution to (1) with w(T, x) 



= ut(x). One of the main tasks in that paper is the study of the nonlinear Young 



integral W(ds, To this end we used the Riemann sum approximation and 
the sewing lemma of [2]. In this paper, we shall study the nonlinear Young inte- 
gral W (dt, (pt) hy means of fractional calculus. This approach may provide more 
detailed properties of the solutions to the equations (see [6, 7]). 

Under certain conditions, we shall prove that the two nonlinear Young integrals, 
defined by Riemann sums (through sewing lemma) or by fractional calculus, are the 
same (see Proposition 2). 

To expand the solution of a (nonlinear) differential equation with explicit remain- 
der term we need to define (iterated) multiple integrals (see [3]). We shall also give a 
definition of the iterated nonlinear Young integrals. Some elementary estimates are 
also obtained. 

The paper is organized as follows. Section 2 briefly recalls some preliminary 
material on fractional calculus that are needed later. Section 3 deals with the nonlinear 
Young integrals and Sect. 4 is concerned with iterated nonlinear Young integrals. 



2 Fractional Integrals and Derivatives 

In this section we recall some results from fractional calculus. 

Let —00 < a < b < CO, a > 0 and p > 1 be real numbers. Denote by LP(a, b) 
the space of all measurable functions on (a , b) such that 
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ll/llp := If \f(t)\>’dt 






b 




< 00 . 



Denoteby C([( 2 , b]) the space of continuous functions on [a, b]. Let / G ([a,b]). 
The left-sided fractional Riemann-Liouville integral /“+/ is defined as 



and the right-sided fractional Riemann-Liouville integral f is defined as 



where (— 1)~“ = and F (a) = r^~^e~^dr is the Euler gamma function. 

Let (resp. 1^_{LP)) be the image of LP{a, b) by the operator (resp. 

If / c 7“^ {LP) (resp. / G 7^_ (L^)) and 0 < Ck: < 1, then the (left-sided or 
right-sided) Weyl derivatives are defined (respectively) as 



where a < t < b (the convergence of the integrals at the singularity s = t holds point- 
wise for almost all ^ g (a, b) if p = 1 and moreover in L^-sense if 1 < p < oo). 

It is clear that if / is Holder continuous of order p. > a, then the two Weyl 
derivatives exist. 

For any ^ g (0, 1), we denote by b]) the space of ^0-Holder continuous 

functions on the interval [a,b]. We will make use of the notation 




( 3 ) 



a 



b 




( 4 ) 



and 






a<r<b 



(which is a seminorm) and 
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where / : M ^ M is a given continuous function. 

It is well-known that C^([a, b]) with the Holder norm \\f\\^;a,b + \\f\\oo;a,b is a 
Banach space. However, it is not separable. 

Using the fractional calculus, we have (see [9] and also [3]) 

Proposition 1 Let 0 < a < 1. If f and g are continuously differentiable functions 
on the interval [a, b\ then 

b b 

J fdg = (-1)“ J (£>“+/ (r)) (olz^gb- (o) dt, (7) 

a a 

where gb- (t) = g (t) - g (b). 

In what follows k denotes a universal generic constant depending only on A., r, o' 
and independent of W, (p and a,b. The value of k may vary from occurrence to 
occurrence. 

For two function f,g:[a,b]^R, we can define the Riemann-Stieltjes integral 
fa f(i)dg{t). Here we recall a result which is well-known (see for example [3, 9] 
or [6, 7]). 

Lemma 1 Let f and g be Holder continuous functions of orders a and f respec- 
tively. Suppose that a ^ > 1. Then the Riemann-Stieltjes integral f(t)dg(t) 
exists and for any y ^ a), we have 

b b 

j f{t)dg{t) = (-1)’' j Dlf{t)Dl7 gb-{t)dt . (8) 

a a 

Moreover, there is a constant k such that 



b 

j f(t)dg(t) 

a 



< K\\g\\i};a,bi\\f\\oo-.a,b\b 



-a\f^ 



+ 



a-a,b\b-ar+f^). 



(9) 



Proof We refer to [9] or [3] for a proof of (8). We shall outline a proof of (9). Let y 
be such that a > y > \ — f. Applying fractional integration by parts formula (8), 
we obtain 



b 

J f(t)dg{t) 



b 

< j \Dl^f{t)Dl~J gj,^{t)\dt. 



a 



a 



From (5) and (6) it is easy to see that 

\d 17 gb-{t)\ < K\\g\\p-a,b{b - 
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and 

\Dl+f{t)\ < K[\\f\U,a,b{t - + Wf\\a-,aAt ~ 



Therefore 

b 



f(t)dg{t) 



<K\\g\\p-,a,by\f\\oo-,a,b J(t-a) ^dl 

+\\f\\a-,a,b jit- ar-y(b - t/+y-Ut ’ 



The integrals on the right hand side can be computed by making the substitution 
t = b — (b — a)s. Hence we derive (9). 

We also need the following lemma in the proofs of our main results. 

Lemma 2 Let f{s,t), a < s < t < b be a measurable function of s and t such that 



II 



(t -sf-^ 



dsdt < 00. 



Then 



b b 



Proof An application of Fubini’s theorem yields 

a a a 

b b 



-mil 



fjs, t) 



fjs, t) 

(r 



dsdt 



dtds 



a s 
b 



i-ir j C-fitj%'=tdt 



which is the lemma. 
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3 Nonlinear Integral 



In this section we shall use fractional calculus to define the (pathwise) nonlinear 
integral j^W(dt,(pt). This method only relies on regularity of the sample paths 
of W and (p. More precisely, it is applicable to stochastic processes with Holder 
continuous sample paths. 

Another advantage of this approach is that in the theory of stochastic processes 
it is usually difficult to obtain almost sure type of results. If the sample paths of the 
process is Holder continuous, then one can apply this approach to each sample path 
and almost surely results are then automatic. 

In what follows, we shall use W to denote a deterministic function W : M x ^ 
M^. We make the following assumption on the regularity of W : 

(W) There are constants r, A. g (0, 1] such that for all finite a < b and for all 
compact sets K of the seminorm 



\\W\\r,XM 

I W(^, x) - W(t, x) - W(s, y) + W(t, y)\ 

— ^ 

a<s<t<b k y| 

x,yeK;x^y 

\W(s,x) - W(t,x)\ \W(t,y) - W(t,x)\ 

+ sup h sup ^ 

a<s<t<b 1^ a<t<b 

xeK x,yeK;x^y 



, ( 10 ) 



is finite. 

About the function (p, we assume 

(0) (p is locally Holder continuous of order ]/ g (0, 1]. That is, the seminorm 



\W\\Y\a,b 



sup 

a<s<t<b 



\(p{t) - (;p(5)| 

\t-s\y 



is finite for every a < b. 

Among the three terms appearing in (W), we will pay special attention to the first 
term. Thus, we denote 



1^]t, X; a,b,K 



I W(^, x) - W(t, x) - W(s, y) + W(t, y)| 
sup ^ 

a<s<t<b 1^ iS’l^lv y| 

x,yeK;x^y 



If a, b is clear from the context, we frequently omit the dependence on a, b. In 
addition, throughout the paper, the compact set K can be chosen to be any compact set 
containing the image of (p on the interval of integration. Thus we omit the dependence 
on K as well. For instance, || W||r,;i is an abbreviation for \\W\\xx,a,b,K^ \W\\y is an 
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abbreviation for \\(p\\y-a,b and so on. We shall assume that a and b are finite. Thus it 
is easy to see that for any c G [a,b] 

sup \(p{t)\= sup |^(c) +^(r) - ^(c)| < |^(c)| + ll^llyiz? - al’' < 00 . 

a<t<b a<t<b 

Thus assumption (0) also implies that 



\W\\oo\a,b '■= sup \(p{t)\ < OO. 

a<t<b 



Remark 1 Given a stochastic process indexed by (t,x), it is possible to obtain 
almost sure regularity of the type (10) by a multiparameter Garsia-Rodemich-Rumsey 
inequality. Indeed, this has been explored in [4], see also the last section of [5]. 

One of our main results in this section is to define W(dt,(pt) under the condition 

Xy T > 1 through a fractional integration by parts technique. The following 
definition is motivated from Lemma 1 . 

Definition 1 We define 

b b 

J W{dt,cpr) = i-ir J D^^{DlZ“’‘Wb-it,W)\t'=tdt ( 11 ) 

a a 

whenever the right hand side makes sense. 

Remark! Assume J = 1. Let W{t,x) = g{t)x be of the product form and let 
(p(t) = f{t), where g is a Holder continuous function of exponent r and / is a 
Holder continuous function of exponent A. Ifl — r < a <A, then 



/ 



W(dt, (Pt) 



a 



b 

(- 1 )“ j Dl(DlZ^’^Wb-{t, 



a 

b 

(-I f j DlZ“’’gb-{t)D^’lf{t)dt. 

a 



Thus from (8), W(dt, is an extension ofthe classical Young integral f(t)dg(t) 

(see [3, 8, 9]). For general d, if W(t,x) = ^f=igi(t)xi and (pi(t) = 



fi (0, then it is easy to see that W(dt,(pt) 




fi(t)dgi(t). 



The following result clarifies the context in which Definition 1 is justified. 
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Theorem 1 Assume the conditions (W) and (0) are satisfied. In addition, we sup- 
pose that ky -\-T > 1. Let a G (1 — Then the right hand side of (1 1) is finite 

and is independent of a G (1 — x,X). As a consequence, we have 



/ 



Widt, ipt) 



a 



= (-i)‘ 



o 

i 



7 ^ 1 — 0(,t 



Wb-{t, (Ptf)\y=tdt 



1 

r(a)F(l-a) 



/ 



Wb-(t,cpt) 

(b — tf~^(t — a)^ 



dt 



-// 



Wb-{t,(pt)-Wb-{t,cpr) 
(Z. - - r)“+l 



+ (1 



b b 




a t 



Wit, cpr) - l¥(^, (Pt) ^ 

;; dsdt 

is - t)^-^ it - aY 



a a t 



Wjs, cpt) - Wjt, cpr) + Wjs, cpr) 
is - tY~»it - r)“+l 



dsdrdt 



( 12 ) 



where Wb- it, x) = W it, x) — W ib, x). Moreover, there is a universal constant k 
depending only on r, X and a, but independent W, <p and a, b such that 

Widt, (ft) 

a 

where || VK||r,A;a,^ = \\^\\x,k\a,b,K K is the closure of the image of((pt,a < 
t <b). 

Proof We denote II W II = ||W||r,;^;a,&- First by the definitions of fractional derivatives 
(5) and (6), we have 



<K\\Wh^X-,a,b{b- 



-ay+K\\W\\r,x-,a,b\Wty,ayb- 



-a) 



T+ky 



(13) 



DlY‘'*Wb-it,w) 



(-!)*-“ 

rice) 



I Wb-it,Vt’) 

1 ib-ty-‘^ 



+ (1 



-.)/ 



is - 



ds 
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and 






(- 1 ) 



1— O' 



Wb-{t,w) 

r(a)r(l-a) I (t’-ar 



+ 



f Wb-{t,<P,' 

“7 (t'-rr 



)-Wb-{t,<Pr) 



- r)“+l(Z? - t)^~ 



dr 



I -a f W(t, <p,,) - (pt>) 



{f — aY J (s — 






W(t, CPt>) - W(s, CPt>) - Wit, CPr) + W{s, CPr) 
{s - 



dsdr . 



Thus the right hand side of (1 1) is 



1 



r{a)r{\ - a) 
b b 

+ {\-a) 



Wb-{t,<pt) 



'// 



(Z?-0 



b t 

'JI 



Wb-(t,<Pt)-Wb-(t,(Pr) 



dt + a / ; n — drdt 

' ' (b-ty-‘^(t -rr+^ 



(j — t)^ “(f — 

a t 

b t b 



dsdt 



'JII 



+ a{\ — a) 

a a t 

=: I\ + h + h + h ‘ 



W(C q)t) - q)t) - W{t, ipr) + ipY 
{s -rY^^ 



dsdrdt 



( 14 ) 



The condition (W) implies 



b 

h<K\\W\\ -a)~“dt 

a 



= K\\W\\(b-ay . 



( 15 ) 



Similarly, we also have 



b b 

/3 <^iiwii j jy- 

a t 



<K\\W\\(b-ay . 



r)"+“-2(r 



a) “dsdt 



( 16 ) 
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The assumptions (W) and (0) also imply 

|Wfo_(r, ipr) - Wb-(t, iPr)\ < K\\W\\\b - t\^\ipt - 

<K\\W\\\\ipt^\b-t\At-r\^y . 



This implies 



b t 



/2</c||W||||^||: 



J J(b-ty+‘=‘-^{t 



drdt 



<K\\w\\\\<p\t{b-ay+^y . 



(17) 



Using 

\W{t, <pt) - W(S, (ft) - W(t, <Pr) + <Pr)\ < ^11 II 119^11^ |r -s\Ai - r\^y , 



we can estimate I 4 . as follows. 



h 



< 'file'll II II?; / / / — — — ^-^^^dsdrdt 

a a t 



<^||M'||||^|i;)a.-a)^+^^ 



(18) 



The inequalities (15)-(18) imply that for any a G (1 — r, y A), the right hand side of 
(11) is well-defined. The inequalities (15)-(18) also yield (13). 

To show (12) is independent of a we suppose a' , a G {\ — T,'ky),a' > a. Denote 
^ = a' — a. Using Lemma 2, it is straightforward to see that 

b 

(-1)“ J DliDlZ^'’'Wb-{t. %')\t'=,dt 

a 

b 

= (-I f j I^:lD^yD‘^AlDlZ‘^’''Wb-{t,w)\,'=tdt 

a 

b 

= (-l)“+^y Wb-{t,w)\t'=tdt 

a 

b 

= J Di^i^'_!^Dyywb-{t,<Pt')\,'=,dt 

a 

b 

= (- 1 )“'/ D^^^DlZ‘^''''Wb-{t,w)\,'=tdt. 
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This proves the theorem. 

Now we can improve the equality (13) as in the following theorem 

Theorem 2 Let the assumptions ( W) and (0) be satisfied. Let a, b, c be real numbers 
such that a < c < b. Then there is a constant k depending only on z,X and a, but 
independent W, (p and a, b, c such that 



Proof Let a < c < d < b and let 0(0 = (p(c)x[c,d)(i)^ where X[c,d) is the indicator 
function on [c,d). Then 



This means W(t, 0(0)) = W(t, (p(c))x[c,d)(t'). Hence, from (8) we have 



Let c be any point in [a, b]. Denote W(t,x) = W(t, x) — W(t, (pc). Then W 
satisfies (W). As in the Eq. (14), we have 



where h = h and O = O are the same as O and O in the proof of Theorem 1 . But 



b 



/ 



W{dt, cp,) - W(b, <Pc) + W{a, cpc) < K\\Wh,x-,a.bMy-,a,b(^ ~ • 



a 



(19) 



W(t, (p(c)) c < t' < d 



W(t,(p(t')) = 



W(t,0) elsewhere. 



b 



b 




a 



a 



b 




a 



= W(d, (p(c)) - W(c, (p(c )) . 



b 



b 




a 



a 



= 7l + /2 + 0 + 4 . 
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~ ^ 1 f W{t, (ft) - W{b, cp,) - W{t, (Pc) + W{b, (Pc) 

‘ F{a)r{\-a)J {b-tY-<^{t -aT 

a 

/ = (l-«) f f W{t, cp,) - W{s, cp,) - W{t, cp,) + 

^ r{a)r{l-a)J J {s-t)^-»{t-ar 

a t 

From the assumptions (W) and (0) we see that 

\W{t^ ipt) - ipt) - ipc) + W{b^ ipc)\ 

SK\\W\\,M\\<P\\y-,a,b\b-t\'^\t-c\^^ 
SK\\Wh,x-,aAv\\y-,a,b\b-t\^\t-a\^^ ■ 

This implies that 

h<K\\W\\rM\Wty,a,bib-ay+^y . (20) 



Similarly, we have 



hSK\\Wh^X-,a,b\My-,a,b(b-ay+^^ . 



( 21 ) 



Combining these two inequalities (20) and (21) with the inequalities (17) and (18) 
we have 



/ 



W{dt, (Pt) 



a 



< K\\W\\rMMy-,a,b(^ - ' 



which yields (19). 

Theorem 3 Let the assumption (W) be satisfied. Let (p \ [a, b] ^ satisfy 

\(p(s) — (p(a)\ < L\s — af Ws e [a, b] and sup ^ ^ 

a<t<s<b {s - t)y - 



( 22 ) 

for some £ € (y, oo) and for some constant L e (0, oo). Ifz+Xy > 1, then for any 
^ 1 -)_ >^y+r-i g have 



/ 



W{dt,q>t)-W{b,q)a) + W{a,q)a) 



a 



<C{b-af , 



(23) 



here the constant C does not depend on b — a. 

Proof As in the proof of Theorem 2 we express W {dt, (pt) — W{b, (pa)^-W {a, (pa) 

as the sum of the terms Ij, j = 1, 2, 3, 4 (we follow the notation there). First, 
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we explain how to proceed with U. We shall use C to denote a generic constant 
independent of b — a. Denote 

J ■- \W{t, <pt) - W(^, (ft) - Wit, cpr) + IT(S, cpr)\ 

First, we know that we have 



J <C\t - s\^\t -r\^^ . (24) 

On the other hand, we also have 



J < \Wit, q>,) - lF(s, q>t) ~ W(t, ipa) + lF(s, <Pa)\ 

+ \W(t, q>r) - W(S, q>r) - Wit, <Pa) + W(^, <Pa)\ 



< cit -^r 

< C\t — |t — 



t -a\^^ + \r -a\^^ 



(25) 



when a<r<t<s<b. Therefore, from (24) and (25) it follows that for any 
Pi >0 and > 0 with + ^2 = 1, we have 

J < C\t -s\^\t 



If we choose a and P\ such that 



r + O' > 1 , Pi^y — a > 0 



(26) 



then 



I 4 < Cib - _ 



For any /I < 1 + ^ we can choose a, and /I 2 such that (26) is satisfied 

and 

4 < Cib - af . 

The term I 2 can be handled in a similar but easier way and a similar bound can be 
obtained. 

Now, let us consider . We have 

\wit, cpt) - lF(s, cp,) - Wit, <Pa) + IF(S, ,pa)\ < C|t - ^r|t - . 

This easily yields 

h < Cib - . 

A similar estimate holds true for I\ . However, it is easy to verify r + > 1 + 

^ ^ theorem is proved. 
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Next, we show that the nonlinear integral defined in Definition 1 coincides with 
the limit of Riemann sums. For this purpose, we need some preliminary set up. For 
every 5 ’, t in [a, /?], we put t) = W{t, (ps) — Let n = {a = to < t\ < 

. . . < = /?} be a partition of [a, h] with mesh size |7r| = maxi</<„ \ti — 

One can consider the limit of the Riemann sums 



lim 

l = l 



whenever it exists. A sufficient condition for convergence of the Riemann sums is 
provided by following two results of [2], see also [1] for a simple exposition. 

Lemma 3 (The sewing map) Let /jl he a continuous function on [0, with values 
in a Banach space B and £ > 0. Suppose that /x satisfies 

|/x(< 2 , b) — pi(a, c) — /x(c, b)\ < K\b — 'i Q < a < c <b <T . 



Then there exists a function ^ pi (t) unique up to an additive constant such that 
\/ ix{b)- / ix{a)-tx{a,b)\ < K{\-2-W\b-a\^+^ <a <b <T . (27) 
We adopt the notation ^ tx{b) — ^ ix{a) 

Lemma 4 (Abstract Riemann sum) Let jt = {a = to < t\ <• - < tm = b} be an 
arbitrary partition of [a, b] with Itt | = sup^-^Q —ti\. Define the Riemann 

sum 

m — l 

Jn — ^ U+\) 

then Jjj; converges to as \7t\ 0 . 

Because r Xy is strictly greater than 1, the estimate (19) together with the 
previous two Lemmas implies 

Proposition 2 Assume that (W) and (0) hold with Xy z > 1. As the mesh size 
Itt I shrinks to 0, the Riemann sums 

n 

i = l 

converges to W(dt, cpt). 

Remark 3 In [5] , the authors define the nonlinear integral J W(dt, (pt) via the sewing 
Lemma 3. The previous proposition shows that the approach using fractional calculus 
employed here produces an equivalent definition. Let us note that this is possible 
because of the key estimate (19) and the uniqueness part of the sewing Lemma 3. 
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It is easy to see from here that 



/ 



W{dt, (ft) 



a 



c b 

J W(dt,^t) + J W(dt,(pt) 



V a < c < b . 



This together with (13) imply easily the following. 

Proposition 3 Assume that (W) and (0) hold with Xy x > As a function 
of t, the indefinite integral | /J W(ds, (ps) , ^ < a < Z?| is Holder continuous of 
exponent x. 

Further properties can be developed. For instance, we study the dependence of 
the nonlinear Young integration / W (ds, cps) with respect to the medium W and the 
integrand (p. We state the following two propositions whose proofs are left for readers 
(see, however, [5] for details). 

Proposition 4 Let W\ and W 2 be functions onRxM.^ satisfying the condition ( W). 
Let ip be a function in (M; M'^) and assume that x -\- ky > 1. Then 



b b 

I j Wi(ds,,ps)- J W2(ds,<Ps)\<\Wi(b,<Pa)-Wi(a,<Pa)-W2(b,<Pa) + W2(a,,pa)\ 

a a 

+ c(||^||oo)[Wi - W2]p,r,x\\(p\\y\b - . 

Proposition 5 Let W be a function onRxM.^ satisfying the condition (W). Let ip^ 
and ip^ be two functions in C^(M; M^) and assume that x -\- ky > 1. Let 0 G (0, 1) 
such that X + Oky > 1. Then for any u < v 



V V 

I J W(ds,<pl)- J W(ds,<p^)\ 

u u 

s cdwh,x\\P - - u\^ 

+ C2[W]r,x\\P - , 



where C\ is an absolute constant and C 2 = 2* ^Ci(||^^||^ + ||^*||^)^. 



4 Iterated Nonlinear Integral 



From Remark 2 we see that if W(t,x) = 2;=i (pi(t) = fi(t), then 

" } 

/; w(dt, cpt) = Z fi (t)dgi {t). We know that the multiple (iterated) integrals of 
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the form 



/ 



, ^2, • • • , Sn)dg{si)dg{S 2 ) ■ ■ ■ dg{s„) 



a<si<S2<---<Sn<b 



are well-defined and have applications in expanding the solutions of differential 
equations (see [3]). What is the extension of the above iterated integrals to the non- 
linear integral? To simplify the presentation, we consider the case d = 1. General 
dimensions can be considered in a similar way with more complex notations. 

We introduce the following notation. Let 



n,a,b 



:= {(^ 1 , a < Si < S2 < - - < Sn < b} 



be a simplex in ^ 



Definition 2 Let (p : An,a,b Mbea continuous function. For a fixed Sn G 
[a,b], we can consider cp(-, Sn) as a function of n — I variables. Assume we can 



define 



^n-l,a 



(p(si, . . . , Sn)W(dsi, •) • • • W(dsn-i, •), which is a function 



of Sn, denoted by then we define 



/ 



(p(Su...,Sn)W(dSu-)---W(ds, 



Cl<Sl<---<Sn<b 



D 

■-/ 



W{dSn,cPn-l{Sn)). (28) 



In the case W(t,x) = f(t)x, such iterated integrals have been studied in [3], where 
an important case is when ^(^-i , . . . , Sn) = p (^-i) for some function p of one variable. 
This means that (p(s\, . . . , Sn) depends only on the first variable. This case appears 
in the remainder term when one expands the solution of a differential equation and 
can be dealt with in the following way. 

Let F\, F 2 , . . . , Fn be jointly Holder continuous functions on [a, b]^. More pre- 
cisely, for each i = 1, ... ,n, Fi satisfies 



IG(‘5’1, ^l) - Fi(S 2 , h) - Fi(s\, t 2 ) + Fi(S 2 , t 2 )\ 

< \\Fi\\r.X-,a,bUl - S2\^\h - t2\^ , for all , ^2, , ?2 in [fl, ^]. 



(29) 



We assume that r -h A. > 1. 

Suppose that F is a function satisfying (29) with r + A. > 1. The nonlinear 
integral F(ds, s) can be defined analogously to Definition 1. Moreover, for a 
Holder continuous function p of order A., we set G(s,t) = p(t)F(s,t), it is easy to 
see that 
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|G(Si, t\) — G{S2, fl) — G(^1, t2) + G{t\, t2)\ 

< \pih) - p{t2)\\F{s\,t\) - F{s 2, h)\ 

+ lpte)ll^’(^i, h) - F(s 2 , h) - F{s\, t 2 ) + Fi{t\,t2)\ 
<(llp|lrll + llplloo)l|F||r,Ak-^2rkl-f2l^ 



Hence, the integration / p{s)F{ds, s) is well defined. In addition, it follows from 
Theorem 2 that the map t p{s)F{ds, s) is Holder continuous of order r. 

We have then easily 

Proposition 6 Let pbea Holder continuous function of order X. Under the condition 
(29) and r > 1 /2, the iterated integral 



• • • 9 ^n) 



is well defined. 



- I 

Cl<Sl<-"<Sn<b 



p(si)Fl{dsi,Sl)F2(dS2, S2)--- F„{dSn,Sn) 

(30) 



In the simplest case when p{s) = 1 and F, (s, t) = f{s) for all / = 1, . . . , n, the 
above integral becomes 



I 

Cl<Sl<-"<Sn<b 



dfisi) ■ ■■dfiSn) 



(fib) - f{aw 

n\ 



Therefore, one would expect that 

\b-a\y^ 

\Ia,b(Fu-..,Fn)\<K^- (31) 

nl 

This estimate turns out to be true for (30). 

Theorem 4 Let Fi, . . . , F^ satisfy (29) and p be Holder continuous with exponent X. 

A- + r — 1 — 1 

We assume that p{a) = 0. Denote f = and = h + 

X P ~ ^ 

X). Then, for any yn < In, there is a constant Cn, independent of a and b (but may 
depend on yn ) such that 



\IaM.Fu . . . , Fn)\ < Cn\b - af" . 



(32) 
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Proof Denote 



jik) 

^a,s 



(Fu...,Fk) 



- / 

Cl<Sl<---<Sk<S 



p{si)Fi{dsi, si)F2{ds2, S2) ■ • ■ Fk(dsk, Sk). 



Thus, we see by definition that 

, Fk+i) = j Fk+iidr, /W(Ti, . . . , Fk)). (33) 

a 

We prove this theorem by induction on n. When n = 1, the theorem follows straight- 
forwardly from (19) with the choice c = a. Indeed, we have \I^)\ < C\t — 

and|/i,V-/i,Vl <C|t-^r- 

The passage from ^ to ^ -h 1 follows from the application of (23)-(33) and this 
concludes the proof of the theorem. 

Remark 4 The estimate of Theorem 4 also holds true for the iterated nonlin- 
ear Young integral ■ ■ ■ , Fn) = Ia<si<-<sk<s p{si))F2(ds2, S2) ■ ■ ■ 

Fn(dSn,Sn), where I^’^liFu . . . , Fk) = Fk(ds, i!/^s~^\Fu . . . , Fk-i)), and 
iaMF,) = f^Fi(ds,p(s)). 
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A Weak Limit Theorem for Numerical 
Approximation of Brownian Semi-stationary 
Processes 



Mark Podolskij and Nopporn Thamrongrat 



Abstract In this paper we present a weak limit theorem for a numerical approxima- 
tion of Brownian semi- stationary processes studied in [14]. In the original work of 
[14] the authors propose to use Fourier transformation to embed a given one dimen- 
sional (Levy) Brownian semi- stationary process into a two-parameter stochastic field. 
For the latter they use a simple iteration procedure and study the strong approxima- 
tion error of the resulting numerical scheme given that the volatility process is fully 
observed. In this work we present the corresponding weak limit theorem for the 
setting, where the volatility/drift process needs to be numerically simulated. In par- 
ticular, weak approximation errors for smooth test functions can be obtained from 
our asymptotic theory. 

Keywords Ambit fields • Brownian semi- stationary processes • Numerical 
schemes • Weak limit theorems 
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1 Introduction 



Recently, the mathematical theory of ambit fields has been intensively studied in 
the literature. Ambit fields is a class of spatio-temporal stochastic processes that has 
been originally introduced by Barndorff-Nielsen and Schmiegel in a series of papers 
[9-11] in the context of turbulence modelling, but which found manifold applications 
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in mathematical finance and biology among other sciences; see e.g. 14, 8]. In full 
generality they are described via the formula 



where t typically denotes time while x gives the position in space. Furthermore, Af{x) 
and Dt{x) are ambit sets, g and q are deterministic weight functions, a represents 
the volatility or intermittency field, <2 is a drift field and L denotes a Levy basis. We 
recall that a Levy basis L = {L{B) : B g ^}, where ^ is a 5-ring of an arbitrary 
non-empty set S such that there exists an increasing sequence of sets (5^) C ^ with 
^neNSn = S, is an independently scattered random measure. 

An important purely temporal subclass of ambit fields are the so called Levy 
(Brownian) semi- stationary processes, which are defined as 



where now L is a two-sided one dimensional Levy (Brownian) motion and the ambit 
sets are given via Af = Df = (—oo, 0- The notion of a semi-stationary process refers 
to the fact that the process (Xt)teR is stationary whenever (at , crt)teR is stationary and 
independent of (Lt)teR An P^st years stochastic analysis, probabilistic properties 

and statistical inference for Levy semi- stationary processes have been studied in 
numerous papers. We refer to 12, 3, 6, 7, 11, 12, 15, 17, 20, 25] for the mathematical 
theory as well as to 15, 26] for a recent survey on theory of ambit fields and their 
applications. 

For practical applications in sciences numerical approximation of Levy (Brown- 
ian) semi- stationary processes, or, more generally, of ambit fields, is an important 
issue. We remark that due to a moving average structure of a Levy semi-stationary 
process (cf. (2)) there exists no simple iterative Euler type approximation scheme. For 
this reason the authors of 113, 14] have proposed two different embedding strategies 
to come up with a numerical simulation. The first idea is based on the embedding 
of a Levy semi- stationary process into a certain two-parameter stochastic partial 
differential equation. The second one is based upon a Fourier method, which again 
interprets a given Levy semi- stationary process as a realization of a two-parameter 
stochastic field. We refer to the PhD thesis of Eyjolfsson 118] for a detailed analysis 
of both methods and their applications to modeling energy markets. We would also 
like to mention a very recent work 116], which investigates numerical simulations of 
spatio-temporal ambit fields. 

The aim of this paper is to study the weak limit theory of the numerical scheme 
associated with the Fourier method proposed in 114, 18]. In the original work 114] 
the authors have discussed the strong approximation error (in the L^ sense) of the 
numerical scheme for Levy semi- stationary processes, where the volatility process 
(cTt)teR is assumed to be observed. We complement their study by analyzing the weak 
limit of the error process in the framework of Brownian semi- stationary processes. 





( 2 ) 
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where the drift and the volatility processes need to be numerically simulated. This 
obviously gives a more precise assessment of the numerical error associated with the 
Fourier method. 

The paper is organised as follows. In Sect. 2 we describe the Fourier approximation 
scheme for Brownian semi- stationary processes and present the main results on strong 
approximation error derived in [14, 18]. Section 3 is devoted to a weak limit theorem 
associated with a slight modification of the Fourier method. 



2 Basic Assumptions and Fourier Approximation Scheme 



We start with a complete filtered probability space (^2, on which all 

processes are defined. We consider a Brownian semi- stationary process of the form 

g(t - s)cTsW(ds) + [ q(t-s)asds, (3) 

oo J —oo 




where g and q are deterministic kernels, (at)teR nnd (crt)teR are adapted cadlag 
processes, and W is a two sided Brownian motion. To guarantee the finiteness of the 
first integral appearing in (3), we assume throughout the paper that 




g^(t — s)a^ds < 00 



almost surely 



( 4 ) 



for all t e R. When (crt)teR is a square integrable stationary process, the above 
condition holds if g G L^(M>o). The presence of the drift process will be 

essentially ignored in this section. 

Now, we describe the Fourier approximation method introduced in [14, 18] 
applied to the framework of Brownian semi-stationary processes. We start with the 
following assumptions on kernels involved in the description (3): 

Assumption (A): 

(i) The kernel functions g and q have bounded support contained in [0, r] for some 
r > 0. 

(ii) g,q € C(M>o). 

In some cases these conditions are rather restrictive. We will give remarks on them 
below. For any given k > 0, we define 



h(x) := g(\x\) and hx(x) := h(x) Qxp(X\x\). 



( 5 ) 
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Notice that g = h on [0, r]. We introduce the Fourier transform of hx via 

hx(y) := / hx(x)oxp(-ixy)dx. 

JR 

Furthermore, if we assume that hx ^ (M), the inverse Fourier transform exists and 

we obtain the identity 



exp(— A.|x|) f ^ 

h(x) = / hx(y)oxp(ixy)dy. 

Since the Fourier transform maps L^(M) functions into the space of continuous 
functions, we require that h e C(M). This fact explains the Assumption (A)(ii) for 
the kernel function g. Since h is an even function, for a given number A G N, we 
deduce an approximation of h via 



with 



h(x) ~ h^{x) := exp(— k|v|) 






bk 



hxiknlt) 

T 



( 6 ) 



( 7 ) 



Obviously, the above approximation is an -projection onto the linear subspace 
generated by orthogonal functions {cos(k7rx/r), sin(^7rv/r)}^Q, hence we deal 
with a classical Fourier expansion of the function h (recall that the function h is even 
by definition, thus the sinus terms do not appear at (6)). Now, the basic idea of the 
numerical approximation method proposed in [14, 18] is based upon the following 
relationship: 



/' 



g(t-s)asW(ds)^ hN(t-s)(XsW(ds) 



i: 

j: 



QXp(—X{t — S)) 



^0 , , ,k7T(t - s) 

~:r+ ybk cos( ) 

2 



N 



bc) ^ kjc 

= -fXk,u(t, 0 ) + — )- 

k=l ^ 



o-,W(di) 

( 8 ) 



where the complex valued stochastic field Xx^uit, y) is defined via 



Xx,u{t^ y) 




X + iy)(t - s)}CTiW(ds) 



( 9 ) 
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and u e [t — t, t].lnsi second step, for a 5 > 0 small, we observe the approximation 

Xx,uit + 8,y) = exp{(-X + iy)8] y) + exp{(-X + iy)(t - W(d5) 

« exp{(-X + iy)8] {Xx,u(t, y) + ^t(W,+s - W,)) . (10) 

Hence, we obtain a simple iterative scheme for simulating the stochastic field 
in the variable t . Assume for the moment that the drift process a is zero and 
we wish to simulate the trajectory of . . . , given the information available 
at time ^o- Then, the numerical simulation procedure is as follows: 

(a) Simulate the independent increments -T'(0, ti — ti-\) for i = 

1,...,M. _ 

(b) For each i = 1, . . . , M and k = 0, . . . , A, simulate kn jx) from 

Xx,uiti-i,k7tlx), Wt- - Wt-_^ and by using (10). 

(c) Simulate Xt- applying steps (a), (b) and (8) (with u = to). 

Let us explain some properties of the proposed numerical scheme. First of all, there 
are two approximation errors, where the first one (N scale) is coming from the Fourier 
transformation at (6) and the second one (M scale) is coming from the discretization 
error obtained at (10). 

It is important to understand the meaning of knowing the information about the 
involved processes up to time ^o- When the stochastic model for the process (crt)teR 
is uncoupled with (Xt)teR^ then we may use u = t — x at (8). Indeed, in typical 
applications such as turbulence and finance this is the case: (crt)teR is usually modeled 
via a jump diffusion process driven by a Levy process, which might be correlated with 
the Brownian motion W. However, when the process (Xt)teR is itself of a diffusion 
type, i.e. 



g(t - s)a(Xs)W(ds) + f q(t- s)a(Xs)ds 
r J t—T 

it is in general impossible to simulate a trajectory of (Xt)teR^ since for each value t 
the knowledge of the path (Xu)ue{t-z,t) is required to compute X^. But, in case we 
do know the historical path, say, (Xu)ue[-z,0]^ the simulation of values Xt, t > 0, 
becomes possible. 

The main advantage of the numerical scheme described above is that it separates 
the simulation of the stochastic ingredients (cr and W) and the approximation of 
the deterministic kernel g (or h). In other words, the stochastic field Xx,u(t,y) is 
simulated via a simple recursive scheme without using the knowledge of g, while 
the kernel g is approximated via the Fourier transform at (6). This is in contrast to a 
straightforward discretization scheme 

r>j 

/ g{t - s)asW{ds) « Y.g{tj - - W,,). 

Jto 
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This numerical property is useful when considering a whole family of kernel func- 
tions (go)ee&^ since for any resulting model Xt(0) only one realization of the sto- 
chastic field y) needs to be simulated. This can be obviously useful for the 

simulation of parametric Brownian semi- stationary processes. 

We may now assess the strong approximation error of the proposed numerical 
scheme. We start with the analysis of the error associated with the approximation of 
the deterministic kernel g by the function hj^. We assume for the moment that the 
volatility process (crt)teR is square integrable with bounded second moment. Then 
a straightforward computation (see e.g. [14, Eq. (4.5)]) implies that 



E 



sf 



[g(t - s) - hN(t - s)]asW(ds) 



)1 



£ C 



1 -exp{-2A(f-fo)) 



' 00 

k=N+l / 

( 11 ) 



where C is a positive constant and the Fourier coefficients bk have been defined 
at (7). We remark that (1 — exp{— 2k(r — ^o)})/k ^ 2(t — to) as k ^ 0, while 
(1 — exp{— 2k(r — ^o)})/^ as k ^ 00 . Thus, it is preferable to choose the 

parameter k > 0 large. 

Remark 1 A standard model for the kernel function g in the context of turbulence 
is given via 

^(x) = exp(— Xv) 



with X > 0 and a > —1/2. Obviously, this function has unbounded support and for 
the values a g (— 1/2, 0) it is also discontinuous at 0, hence it violates the statement 
of the Assumption (A). However, one can easily construct an approximating function 
gj, which coincides with g on the interval [e, T] and satisfies the Assumption (A). 
Assuming again the boundedness of the second moment of the process {cff)teR^ the 
approximation error is controlled via 



E 



(£ 






si (t 









■'L2((0,e)U(r,oo)) 



Such error can be made arbitrary small by choosing s small and T large. Clearly, 
this is a rather general approach, which is not particularly related to a given class 
of kernel functions g. In a second step one would apply the Fourier approxima- 
tion method described above to the function gj. At his stage it is important to note 
that the parameter A > 0 introduced at (5) is naturally restricted through the con- 
dition A < X; otherwise the kernel hx would have an explosive behaviour at oo. 
Thus, the approximation error discussed at (11) cannot be made arbitrarily small 
in A. □ 



Remark 2 The Fourier coefficients bk can be further approximated under stronger 
conditions on the function h, which helps to obtain an explicit bound at (11). More 
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Specifically, when h e (M) and = 0 for all 7 = 1 , . . . , then it holds 

that 

\bk\<Ck-^\ 

This follows by a repeated application of integration by parts formula (see [14, 
Proposition 4.1] for a detailed exposition). In fact, the original work [14] defines 
another type of smooth interpolation functions h, rather than the mere identity h(x) = 
g(|x I), to achieve that the relationship (r) = 0 holds for all 7 = 1 , . . . , ^ and 

some n eN. □ 

Now, let us turn our attention to the discretization error introduced at (10). We assume 
that ^0 < • • • < is an equidistant grid with ti — U-i = At. According to (10) the 
random variable 



j 

njiy) ■= z exp{(-X + iy)U + 1 - i)At}ar,_, {Wt, - (12) 

i=l 

is an approximation of Xx,tQ(tj, y) for any y G M whenever the drift process a is 
assumed to be absent. When (crt)teR is a weak sense stationary process, a straight- 
forward computation proves that 

E[\Xxjo(tj^ y) - rij(y)\'^] < C(tj - to){(X^ + y^)(At f +E[Wr, - (13) 

We refer to [14, Lemma 4.2] for a detailed proof. 

Remark 3 Assume that the process (crt)teR is a continuous stationary Ito semimartin- 
gale, i.e. 

dat =atdt-\- OfdBt, 



where 5 is a Brownian motion and {at)teR^ are stochastic processes with 

bounded second moment. Then the Ito isometry implies that 

E[\a,^-atf]<CAt. 

Hence, in this setting At becomes the dominating term in the approximation error 
(13). □ 

Combining the estimates at (11) and (13), we obtain the strong approximation error 
of the proposed Fourier method, which is the main result of [14] (see Propositions 
4.1 and 4.3 therein). 

Proposition 1 Let to < • • • < tM be an equidistant grid with ti —ti-\ = At. Assume 
that condition (A) holds and (cr^)feM ^ weak sense stationary process. Then the 
approximation error associated with the Fourier type numerical scheme is given via 
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E 



r /^o kn \|2 

/ gitj - s)asW{ds) - (—tjjiO) + ^bktjj { — )) 

L -^*0 ^ ^ ^ . 

V k=N+l 



(14) 



+ (tj - ro) ^ \bk\f{Atf + {-f(^Y.’^\bk\)\Atf 

[ k=i ^ k=i 

k=l / 



for a positive constant C. 



3 A Weak Limit Theorem for the Fourier 
Approximation Scheme 



As we mentioned earlier, the Fourier approximation scheme investigated in [14, 18] 
basically ignored the need of simulating the volatility process (crt)teR in practical 
applications (the same holds for the drift process (at)teR)- As in the previous section 
we fix a time and assume the knowledge of all processes involved up to that 
time. Here we propose a numerical scheme for simulating the path (Xt)te[to,T] for 
a given terminal time T > to, which is a slightly modified version of the original 
Fourier approach. We recall the imposed condition (A), in particular, the weight 
functions g and q are assumed to have bounded support contained in [0, r]. First 
of all, we assume that we have cMlag estimators (a^ , a/^)^e[ro,r] ®f the stochastic 
process (at, crt)te[to,T] and the convergence rate vm ^ oo as M ^ oo such that the 
following functional stable convergence holds: 



vm (a" - a, O'" - cr] A ?7 = (U\ U^) on D^([to, T]), 



(15) 



where the convergence is on the space of bivariate cMlag functions defined on [to, T] 
equipped with the Skorohod topology D^([to, T]). Let us briefly recall the notion of 
stable convergence, which is originally due to Renyi [27]. We say that a sequence of 
random variables with values in a Polish space (E, S’) converges stably in law 
to F, where Y is defined on an extension (Q\ , PO of the original probability 

(f2, P) if and only if 



lim E[/(F'^)Z] = r [/(F)Z] 
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for any bounded and continuous function / and any bounded ^-measurable random 

variable Z. In this case we write (F” Y). In the following we will deal with the 

space of cadlag processes equipped with the Skorohod topology or with the space 
of continuous processes equipped with the uniform topology. We refer to [1, 23] or 
[27] for a detailed study of stable convergence. Note that stable convergence is a 
stronger mode of convergence than weak convergence, but it is weaker than uniform 
convergence in probability. 

We remark that the estimators {af^)te[to,T] might have a different 

effective convergence rate. In this case we will have either U\ = 0 or U 2 = 0. 

Now, we basically follow the Fourier type approach, which refers to (5) and the 
definition of the function hx, described in the previous section, but we replace the 
Fourier transform approximation proposed at (6) by a Riemann sum approximation. 
More specifically, we introduce the approximation 



h(x) 



exp(— k|x|) 



27T 



' hN(x) := 



/ 

m 



hx(y)expiixy)dy 



exp(— k|x|) 
txN 






(16) 



where is a sequence of numbers in N satisfying c^/N ooasA^ ^ 00 . In 
the following we will also assume that the sequence cjq additionally satisfies the 
condition 



N [ \hx(y)\dy 
Jcn/N 



0 as N ^ 00 . 



(17) 



Clearly, such a sequence exists, since hx e (M). When introducing the approxima- 

tion at (16), we obviously obtain two types of error: The Riemann sum approximation 
error and tail approximation error. Condition (17) guarantees that the Riemann sum 
approximation error will dominate. 

Remark 4 Under some stronger conditions the tail integral at (17) can be bounded 
from above explicitly. Assume that h G C^([— r, r]) such that h'{x) = 0 (cf. Remark 
2). Then a repeated application of integration by parts formula implies the identity 

- f IT// 

hx(y) = / hx{x) cos(yx)dx = ^ / h)^(x) cos{yx)dx 

JR y J-T 

for any y > 0. Thus, for any w > 0, we deduce the inequality 

poo poo 

/ \hx{y)\dy<C\\h''\\Li y-^dy <C\\h'^hm-\ 

J U J U 



Hence, condition (17) holds whenever N^/cn 0 sls N 



00 . 



□ 
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Remark 5 We remark that the Fourier transform used at (6) comes from the 
theory. Thus, in contrast to the L^-distance \\h — the limiting behaviour of a 

standardized version of /z (v) — (v) is difficult to study pointwise. This is precisely 

the reason why we use the Riemann sum approximation instead, for which we will 
show the convergence of N{h{x) — hj^{x)). 

If one can freely choose the simulation rates N and M, the Fourier transform of 
(6) is numerically more preferable. According to the estimate (11) and the upper 
bound for the Fourier coefficient of Remark 2 applied for zz = 1, we readily deduce 
the rate N~^ for the L^-error approximation connected to (6). On the other hand, the 
effective sample size of the Riemann approximation at (16) is In the setting of 
the previous remark the overall Riemann approximation error is max(A“^ , N /cj^). 
Recalling that cj^ /N oo, the obtained rate is definitely slower than the one 
associated with Fourier approximation proposed at (6). 

Nevertheless, as our aim is to precisely determine the asymptotics associated with 
the N scale, we will discuss the Riemann approximation approach in the sequel. A 
statement about the Fourier transform (6) will be presented in Remark 8. □ 

Now, we essentially proceed as in the steps (8)-(10). First of all, it holds that 



/ 



[ hN(t -s)(T,W(ds) 

J U 

rt \ ^ 

= / exp(— k(r — 5’)) • cos( 

J u 



k=0 



k(t — s) 
N 



CN 



= Rc'^bkXx.,u(t, —), 



k=0 



(18) 



where the complex valued stochastic field Xx,u{t, y) is defined at (9) and = 
hx{k/N)/{nN). In a second step, for 5 > 0, we obtain the approximation 



Xx,u(t + 5, y) = exp((->, + ( Xx,u(t, y) + 



' exp{(-A + ( Xx,u(t, y) + 






exp((-X + iy)(t - i)Kty(di) 



exp{(— A + iy)(t — 



>)- 

(19) 



When the estimator is assumed to be constant on intervals [si-\,Si), i = 
1, . . . , M, the last integral at (19) can be easily simulated (cf. (10)). We remark 
that this approximation procedure slightly differs from (10) as now we leave the 
exponential term unchanged. 



A Weak Limit Theorem for Numerical Approximation . . . 



Ill 



In summary, given that the information up to time is available, we arrive at the 
simulated value 



:= f 

Jto 



= / h^it — s)a^W(ds) I q(t — s)afds 



/' 



of the random variable 

x! 



JtC) 



- s)cTiVr(d^) + / q{t — s)asds. 

to to 



( 20 ) 



( 21 ) 



Note that the drift part of the Brownian semi- stationary process X is estimated in a 
direct manner, although other methods similar to the treatment of the Brownian part 
are possible. Now, we wish to study the asymptotic theory for the approximation 
error — X^ . Our first result analyzes the limiting behaviour of the function 

N{h{x) -hN{x)). 

Lemma 1 Define the function := N(h{x) —h]^{x)). Let us assume that the 

condition 



holds. Then, under Assumption (A), (17) and (22), it holds that 



^n(x) ^ if {x) = 



_hm 

27T 



exp(— A.|x|) as N ^ OQ 



( 22 ) 



(23) 



for any x G M. Furthermore, it holds that 



sup \ifN(x)\<C 
Nen, xe[0,r] 



for any T > 0. 

Proof First, we recall a well known result from Fourier analysis (see e.g. 
[19, Theorem 8.22]): The condition (22) implies that 

h'^eL\m, VleCiW). (24) 

Now, observe the decomposition 



ifNix) 



Nexp(-k\x\)^ /'(^:+l)/A' 



7T 



+ 



A^exp(-A.|x|) 



^ r{k+\)/N / ^ \ 

X/ -/Cx(-))dy 

V N J 



/ Kx{y)dy 

J{cn+\)IN 



Tt 



^ Hk+\)/N / ^ N 

T/ Lx(j)-/Cx(-))dy + o(l), 

,^^Pk/N \ N J 
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where Kx{y) = hx(y) co^iyx) and the approximation follows by the inequality 
\Kx(y)\ S \hx(y)\ and condition (17). Let us denote by /c^(y) the derivative of 
Kx(y) with respect to y. Since ^ L^(^>o) because of (24), we deduce 

that 



^n(x) 



A^exp(-Alvl)^ 



7t 



^ Kk+l)/N k / k\ 

gi/» 



exp(— A.|x|) 



CN 



Z , k 

k'J—)+o{\) 
^ N 

k=0 



exp(-A|v|) 

/ K^{y)&y as ^ 00 . 

Jo 



In 



But, since hx vanishes at infinity, we readily obtain that 

rOO ^ 

/ K'x^y)^y = -hxiO). 

Jo 

In order to prove the second assertion of the lemma, we observe the inequality 



exot— A.lxl) r(k-\-i)/N roo 

\JrN(x)\ < ^ y / K(a,^ W)|dy + / |^;.(y)|dy, 

where ^k,N(y) is a certain value with ^k,N(y) ^ (k/N,y). Clearly, the second term 
in the above approximation is bounded in N, since it converges to 0. On the other 
hand, we have that I /c^(y) I < |v| |/z;^(y)| + |/i^(y)|, and since GL^(M>o),we 

readily deduce that 

sup \\I/n(x)\<C. 

NeN, x€[0,r] 

This completes the proof of the lemma. □ 

At this stage we need a further condition on the kernel function g to prove tightness 
later. 

Assumption (B): 

(i) The kernel function g has the form 



g(x) = x^f(x) 



for some Ck: > 0 and function / satisfying /(O) 7 ^ 0. 
(ii) / G C^(M>o) has bounded support contained in [ 0 , r]. 
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Notice that the assumption ck: > 0 is in accordance with the condition (A)(ii). Assump- 
tion (B) implies the following approximation result: 



I 



1 



|g(x + S)-^W|^dx < 



a = 0 

C5min(4,4a+1) q, > q 



(25) 



for 8 e [0, T]. The case a = 0 is trivial, while the other one follows along the lines 
of the proof of [20, Lemma 4. 1]. As a matter of fact, we also require a good estimate 
of the left side of (25) when the kernel g is replaced by the function xf/N defined in 
Lemma 1. In the following we will assume that 

sup f \ifN(x + S) - irN(x)fdx < (26) 

NenJo 



for some £ > 0 and 8 e [0, T]. 

Remark 6 Unfortunately, we have not been able to show the statement of (26) under 
the mere assumption of, say, condition (B). Obviously, as in the case of function g, 
condition (26) would hold if 



V^iv(v) = v'"/a^(x), 

where ffq g CH^^>o) with uniformly bounded derivative in A G N and v in a 
compact interval. We can prove condition (26) explicitly when the function g is 
differentiable. Assume that yh^iy), yh'^{y) g L^(M>o) and cj^ is chosen in such a 
way that the condition 



^ / \yhx{y)\^y ^ o as n ^ oo 
Jcn/N 



is satisfied. As in Lemma 1 we conclude that (|9 ;c9>;^x(t) I ^ (\hx(y)\ + \yxhx(y)\ + 
in the proof of Lemma 1, we deduce that 



sup \xlfNM\<c(N r 
xe[ 0 ,r] V Jcn/N 



\yhx(y)\dyyN 




I^Ay)|dy 



+ V / |9y/^x(^LA((y))|dy + V / 

k^^Jk/N 



\8x8yKxi^k,N{y))\dy 



) 



for certain values ^k,N(y), ^k,N(y) in the interval (k/N, y). Then, due to our inte- 
grability conditions, we obtain 



sup \^'n(x)\ < 00 . 

W€N, X€[0,T] 
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Moreover, condition (26) is trivially satisfied due to mean value theorem. However, 
showing (26) under Assumption (B) seems to be a much harder problem for a G 
( 0 , 1 ). □ 

The next result is the main theorem of our paper. 

Theorem 1 Assume that conditions (A), (B), (17), (22) and (26) hold, and the 
processes (crt)te[to,T] (^nd has finite fourth moment with sup^^^^^ E[cr/^] 

< 00 and sup^^j^^^ sup^^p^ E[(cr/^)'^] < oo. We also assume that the process 
Ul^ = vm(ct/^ — cTf) satisfies 

sup sup E[(Uj^)^] < 00 . (27) 

te[to,T]Mef^ 

Then we obtain the decomposition 

^N,M _ ^0 ^ ^N,M ^ 



such that 



NA 



N,M 



hx(0) 

In 



f 

Jto 



exp(-A.(. - s))cfsW{ds) as N,M ^ oo, (28) 



where ucp convergence means that sup^^j^^^ -At\^Q, and 



vmB’^ a B 



= g{-- s)UjW{ds) + q{-~ s)U}ds as M ^ oo, (29) 

J to J to 



where the stable convergence holds on the space C{\tQ, T]) equipped with the uniform 
topology. 

Proof We start with the decomposition = A^'^ + , where 



,M _ 

Jto 

f = fsit- 

Jto 



A"’™ = / {hN(t - s) - g{t - s)}ayW{ds), 



s){(ry - cr^}W(ds) + 



Jto 



s){a^ — as}ds. 



We begin by proving the stable convergence in (29). Let us first recall a classical result 
about weak convergence of semimartingales (see [23, Theorem VI.6.22] or [24]): 

Let (Yf)se[to,T] be a sequence of cadlag processes such that Y on D([to, T]) 

equipped with the Skorohod topology. Then we obtain the weak convergence 



YfW(ds) 



YsW(ds) onC([to,T]) 
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equipped with the uniform topology. This theorem is an easy version of the general 
result, since the integrator W does not depend on n and hence automatically fulfills the 
P-UT property. The stable nature of the aforementioned weak convergence follows by 
joint convergence ( /q Y^W(ds), , W) (/q T,lT(d^), Y, W) (cf. [24]). Hence, 
we deduce that 



[ F;W(ds)A> f F,l^(ck) onC([ro,r]) (30) 

Jo Jo 

equipped with the uniform topology. It is important to note that this result can not 
be directly applied to the process , since this process is not a semimartingale 
in general. Thus, we will prove the stable convergence (29) by showing the stable 
convergence of finite dimensional distributions and tightness. 

We dx u\, . . . , Uk G [to,T]. Due to the condition (15), the finite dimensional 
version of (30) and continuous mapping theorem for stable convergence, we conclude 
the joint stable convergence 



g(wy - s){cr" - cT5}W(ds)|^_^ ^,VMj^qi--s)[a^-as}ds^ 

- ik 



g(uj-s)U^W(ds) 



to 

I- 



as M ^ 00 . Here we remark that the stable convergence for the second compo- 
nent indeed holds, since the mapping F : C([to, z]) x D([to, T]) ^ C([^o, T]), 
F(q, a) = q(- — s)asds is continuous. Hence, we are left with proving tightness 

for the first component of the process . We dx u,t e [to, T] with t > u and 
observe the decomposition 



vm 



g(t - ^){cr" - cTi)lF(d^) - I g(u- s){cr” - cTi}W(ds) 

to Jto ) 



f 

Jto 



M 



(/: 

+ [ - s) - g(u - ^)}{cr" - cr^}W(ds) ) := u) + u). 

Jto 



g{t - s){a^ - CT^}lF(ds) 






,(i)/ 



?(2), 



Using Burkholder and Cauchy-Schwarz inequalities and (27), we have 



E[|<)(f, M)h < C{t - u) f \g(t - ^)|^d^. 

Ju 

Thus, we conclude that 



E[|<)(r,n)|4] <C(r-w)2. 



(32) 
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Now, using the same methods we conclude that 

E[|/?®(f, <C f \g{t -s)- g(u - < C(t - ( 33 ) 

Jto 

where we used the inequality (25). Thus, applying (32), (33) and the Kolmogorov’s 
tightness criteria, we deduce the tightness of the first component of the process . 
This completes the proof of (29). 

Now, we show the pointwise convergence at (28). Recalling the notation from 
(23), we need to show that 

/ [ifNit -s)]a,^W{ds) -^0 asN,M-^oo 

Jto 

for a fixed t. The Ito isometry immediately implies that 



sup E 

MeN 



-s) - \l/(t - 5)}or/^lT(d^) 



1 -/: 



-s)~ fit - 5)rd^ 



0 as ^ oo, 



which follows by Lemma 1 and the dominated convergence theorem. Hence, we 
obtain pointwise convergence at (28). Since the limiting process A is continuous, we 
now need to show that 

sup E[A^^(Af’" - < C{t - m)1+^ 

A^,MeN 

for ^0 < w < L to conclude ucp convergence from pointwise convergence in proba- 
bility. Applying the same methods as in (32), (33) we deduce the inequality 

sup E[Af'^(Af 

iV,MeN 

< C ({t - u) f + f \llfNit - s) - ifN(u - s)\^ds 

\ j U Jto 

< C(f - M)*+^ 

which follows by Lemma 1 and condition (26). This completes the proof of 
Theorem 1. □ 

Remark 7 We remark that the stronger conditions (B) and (26) are not required to 
prove the finite dimensional version of convergence (28) and (29). □ 

Theorem 1 immediately applies to the weak approximation error analysis. Assume for 
simplicity that M = M(N) is chosen such that vm/N 1, so that the Riemann sum 

approximation error and the simulation error from (15) are balanced. We consider a 



A Weak Limit Theorem for Numerical Approximation . . . 



117 



bounded test function cp g (M) with bounded derivative. The mean value theorem 

implies the identity 

’") - <p(X^) = - Z?), 

where random value between Xf and ’ with ^n,m — ^ Xf SiS N ^ oc. 

By properties of stable convergence we deduce that N(xf’^ — Xf)) 

(Xf, At + Bt). Hence, given the existence of the involved expectations, we conclude 
that 

E[^)(zf ’")] - E[^(Z?)] = N-^c'[ip\xf){A, + B,)] + o(Z-‘). (34) 

(Recall that the limit At + Bf is defined on the extended probability space 

Remark 8 The results of Theorem 1 may also apply to the original Fourier approxi- 
mated method proposed in [14, 18]. Let us keep the notation of this section and still 
denote the approximated value of X^ by X^’^ . Recalling the result of (1 1) (see also 
Remark 2) and assuming that M = M(N) is chosen such that ^ 

we readily deduce that 

VMixf'^ -X^)^ B,. ^ 

Remark 9 The results of Theorem 1 might transfer to the case of Levy semi- 
stationary processes 



g(t - s)asL(ds) A- / q(t-s)asds 
00 7—00 

under suitable moment assumptions on the driving Levy motion L (cf. [14]). How- 
ever, when L is e.g. a ^0-stable process with p g (0, 2), it seems to be much harder 
to access the weak limit of the approximation error. □ 

In the following we will present some examples of convergence at (15) to highlight 
the most prominent results. For simplicity we assume that a = 0 in all cases. 

Example 1 Let us consider a continuous diffusion model for the volatility process 
cr, i.e. 

dcr^ = a(cft)dt -h v(cft)dBt, (JtQ = xq, 

where 5 is a Brownian motion possibly correlated with W. We consider an equidistant 
partition to = sq < s\ < • • • < sm = T of the interval [^o, T] and define the 
continuous Euler approximation of at via 




)(^ - ^k) + v{a^){Bt - BsO, t e [su, 
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When the functions a and v are assumed to be globally Lipschitz and continuously 
differentiable, it holds that 



Vm(ct" - O') t/2 onC([ro, 7’]), 



where is the unique solution of the stochastic differential equation 



dU^ = a\at)Ufdt^v\at)UfdBt - ~—vv\cft)dW[, 

V 2 



where W' is a new Brownian motion independent of ^ . We refer to [22, Theorem 
1.2] for a detailed treatment of this result. □ 

Example 2 Let us now consider a discontinuous diffusion model for the volatility 
process a, i.e. 

dat = v{ot-)dLt, = xq, 



where L is a purely discontinuous Levy process. In this framework we study the 
discretized Euler scheme given via 

- Ls,), k = 0,...,M-l. 

We define the process In [21] several classes of Levy 

processes L has been studied. For the sake of exposition we demonstrate the case of a 
symmetric ^0-stable Levy process L with /3 e (0, 2). Let us assume that v e C^(M). 
Then, it holds that 



(M/ log(M))*/^t/" t/2 on D{[to, T]), 

where is the unique solution of the linear equation 

dU^ = E\Gt-)Uf_dLt - EE\Gt-)dL', 

and L' is another symmetric ^0- stable Levy process (with certain scaling para- 
meter) independent of We note that this result does not directly correspond 
to our condition (15) as the discretized process G\tM]/M is used in the definition 
ofU^. □ 
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Non-elliptic SPDEs and Ambit Fields 
Existence of Densities 



Marta Sanz-Sole and Andre SiiB 



Abstract Relying on the method developed in [11], we prove the existence of a 
density for two different examples of random fields indexed by (^, x) g (0, T] xM.^ . 
The first example consists of SPDEs with Lipschitz continuous coefficients driven 
by a Gaussian noise white in time and with a stationary spatial covariance, in the 
setting of [9]. The density exists on the set where the nonlinearity a of the noise does 
not vanish. This complements the results in [20] where cr is assumed to be bounded 
away from zero. The second example is an ambit field with a stochastic integral term 
having as integrator a Levy basis of pure-jump, stable-like type. 

Keywords Stochastic partial differential equations • Stochastic wave equation • 
Ambit fields • Densities 



1 Introduction 



Malliavin calculus has proved to be a powerful tool for the study of questions con- 
cerning the probability laws of random vectors, ranging from its very existence to the 
study of their properties and applications. Malliavin’s probabilistic proof of Horman- 
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der’s hypoellipticity theorem for differential operators in quadratic form provided the 
existence of an infinitely differentiable density with respect to the Lebesgue measure 
on for the law at a fixed time r > 0 of the solution to a stochastic differential 
equation (SDE) on driven by a multi-dimension Brownian motion. The classical 
Malliavin’s criterion for existence and regularity of densities (see, e.g. [13]) requires 
strong regularity of the random vector X under consideration. In fact, X should be 
in the space B^, meaning that it belongs to Sobolev type spaces of any degree. 
As a consequence, many interesting examples are out of the range of the theory, for 
example, SDE with Holder continuous coefficients, and others that will be mentioned 
throughout this introduction. 

Recently, there have been several attempts to develop techniques to prove exis- 
tence of density, under weaker regularity conditions than in the Malliavin’s theory, 
but providing much less information on the properties of the density. The idea is to 
avoid applying integration by parts, and use instead some approximation procedures. 
A pioneer work in this direction is [12], where the random vector X is compared 
with a good approximation X^ whose law is known. The proposal of the random 
vector X^ is inspired by Euler numerical approximations and the comparison is done 
through their respective Eourier transforms. The method is illustrated with several 
one-dimensional examples of stochastic equations, all of them having in common 
that the diffusion coefficient is Holder continuous and the drift term is a measurable 
function: SDEs, including cases of random coefficients, a stochastic heat equation 
with Neumann boundary conditions, and a SDE driven by a Levy process. 

With a similar motivation, and relying also on the idea of approximation, 
A. Debussche and M. Romito prove a useful criterion for the existence of den- 
sity of random vectors, see [1 1]. In comparison with [12], the result is formulated in 
an abstract form, it applies to multidimensional random vectors and provides addi- 
tionally information on the space where the density lives. The precise statement is 
given in Lemma 1 . As an illustration of the method, [11] considers finite dimensional 
functionals of the solutions of the stochastic Navier-Stokes equations in dimension 3, 
and in [10] SDEs driven by stable-like Levy processes with Holder continuous coef- 
ficients. A similar methodology has been applied in [1, 2, 4]. The more recent work 
[3] applies interpolation arguments on Orlicz spaces to obtain absolute continuity 
results of finite measures. Variants of the criteria provide different types of proper- 
ties of the density. The results are illustrated by diffusion processes with log-Holder 
coefficients and piecewise deterministic Markov processes. 

Some of the methods developed in the references mentioned so far are also well- 
suited to the analysis of stochastic partial differential equations (SPDEs) defined by 
non- smooth differential operators. Indeed, consider a class of SPDEs defined by 

Lu(t,x) =b{u(t,x)) -\- a(u{t,x))F{t,x), (t, x) e (0, T] x , (1) 

with constant initial conditions, where L denotes a linear differential operator, 
cr, Z?:IR ^ M, and F is a Gaussian noise, white in time with some spatial corre- 
lation (see Sect. 2 for the description of F). Under some set of assumptions, [20, 
Theorem 2.1] establishes the existence of density for the random field solution of (1) 
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at any point {t,x) G (0, T] x and also that the density belongs to some Besov 
space. The theorem applies for example to the stochastic wave equation in any spatial 
dimensions d 

The purpose of this paper is to further illustrate the range of applications of 
Lemma 1 with two more examples. The first one is presented in the next Sect. 2 and 
complements the results of [20] . In comparison with this reference, here we are able 
to remove the strong ellipticity property on the function a, which is crucial in most of 
the applications of Malliavin calculus to SPDEs (see [19]), but the class of differential 
operators L is more restrictive. Nevertheless, Theorem 1 below applies for example 
to the stochastic heat equation in any spatial dimension and to the stochastic wave 
equation with d < 3. For the latter example, if a, Z? are smooth functions and a is 
bounded away from zero, existence and regularity of the density of u(t,x) has been 
established in [16, 17]. 

The second example, developed in Sect. 3, refers to ambit fields driven by a class 
of Levy bases (see 14). Originally introduced in [5] in the context of modeling 
turbulence, ambit fields are stochastic processes indexed by time and space that are 
becoming popular and useful for the applications in mathematical finance among 
others. The expression (14) has some similarities with the mild formulation of (1) 
(see 3) and can also be seen as an infinite dimensional extension of SDEs driven by 
Levy processes. We are not aware of previous results on densities of ambit fields. 

We end this introduction by quoting the definition of the Besov spaces relevant 
for this article as well as the existence of density criterion by [11]. 

The spaces > 0, can be defined as follows. Let f:R^ M. Forx, h 

set (2\^/)(v) = f{x h) — f(x). Then, for any ^ g N, ^ > 2, let 

(Alf)(x) = {Al(Al-^f))(x) = ^(-ir-j(^\f(x + jh). 

j=o 

For any 0 < 5’ < ^, we define the norm 

ii/b5^ = ii/iiLi + sup [hrwAifWi^u 
\h\<l 

It can be proved that for two distinct n,n' > s the norms obtained using n or n' are 
equivalent. Then we define B\ ^ to be the set of -functions with H/H^j ^ < oo. 
We refer the reader to [22] for more details. 

In the following, we denote by the set of bounded Holder continuous functions 
of degree a. The next Lemma establishes the criterion on existence of densities that 
we will apply in our examples. 
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Lemma 1 Let k be a finite nonnegative measure. Assume that there exist 0 < a < 
a < 1, n e N and a constant Cn such that for all 0 G and all h e with 

\h\ < 1 , 



/ 

JR 









( 2 ) 



Then k has a density with respect to the Lebesgue measure, and this density belongs 
to the Besov space (M). 



2 Nonelliptic Diffusion Coefficients 



In this section we deal with SPDEs without the classical ellipticity assumption on the 
coefficient a, i.e. inf^^j^^/ \cr(x)\ > c > 0. In the different context of SDEs driven 
by a Levy process, this situation was considered in [10, Theorem 1.1], assuming in 
addition that a is bounded. Here, we will deal with SPDEs in the setting of [9] with 
not necessarily bounded coefficients cr . Therefore, the results will apply in particular 
to Anderson’s type SPDEs (a(v) = kx, k / 0). 

We consider the class of SPDEs defined by (1), with constant initial conditions, 
where L denotes a linear differential operator, and cr, Z?:R ^ R. In the definition 
above, E is a Gaussian noise, white in time with some spatial correlation. 

Consider the space of Schwartz functions on R^, denoted by ^(R*^), endowed 
with the following inner product 

(0,0)^:= [ [ r(Jx)0(y)0(y -x), 

jRd J^d 

where T is a nonnegative and nonnegative definite tempered measure. Using the 
Eourier transform we can rewrite this inner product as 



(0, 0)^ = 




/x(J§)^0(§)^0(§), 



where /x is a nonnegative definite tempered measure with ^ pi = F . Let = 

{y, (•, •) and Mr'- = L^([0, T]; ^). It can be proved that F is an iso- 
normal Wiener process on Mr . 

Let A denote the fundamental solution io Lu = 0 and assume that A is either a 
function or a non-negative measure of the form A(t, dy)dt such that 



sup A(t, R'^) < Cr < 00 . 
?€[0,r] 
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We consider 



u(t, x) 




A(t — s, X — y)a(u(s, y))M{ds, dy) 




A(t — s, X — y)b(u(s, y))dydy, 



(3) 



as the integral formulation of (1), where M is the martingale measure generated by 
F. In order for the stochastic integral in the previous equation to be well-defined, we 
need to assume that 



f ds [ < + 00 . (4) 

Jo Jr'I 

According to [9, Theorem 13] (see also [23]), equation (3) has a unique random field 
solution {u(t, x); (t, x) e [0, T] X which has a spatially stationary law (this is 
a consequence of the 5-property in [9]), and for all /? > 2 

sup E[|w(Ax)|^] < 00 . 

{t,x)e[0,T]xRd 



We will prove the following result on the existence of a density. 

Theorem 1 Fix T > 0. Assume that for all t G [0, T\ A{t) is a function or a non- 
negative distribution such that (4) holds and sup^^|^Q A{t, < oo. Assume fur- 
thermore that a and b are Lipschitz continuous functions. Moreover, we assume that 

ct^ < [ ds [ iJi(d^)\^ A(s)(^)\^ < 

Jo Jw 

[ ds\^A{s)(Q)\^ <Ct^, 

Jo 

for some y, yi,Y 2 > 0 and positive constants c and C. Suppose also that there exists 
8 > 0 such that 

E[|M(f,0)-M(s,0)|2] <C|^-^|^ (5) 

for any s,t e [0, T] and some constant C > 0, and that 

min{yi, y2}+'5 

y\ = > 1. 

K 

Fix {t, x) G (0, r] X Then, the probability law of u(t, x) has a density f on 
the set {y G M; o(y) 0}. In addition, there exists n > \ such that the function 

y |cr(y)|”/(y) belongs to the Besov space with ^0 G (0, y — 1). 
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Proof The existence, uniqueness and stationarity of the solution u is guaranteed by 
[9, Theorem 13]. We will apply Lemma 1 to the law of u(t , x) dX x = 0. Since 
the solution u is stationary in space, this is enough for our purposes. Consider the 
measure 

K{dy) = |cr(y)r [P o m(/, 0)“^] {dy). 

We define the following approximation of Let for 0 < £ < r 

(t,0) = (t,0) o-iuit — £,0)) [ f A(t — s, —y)M(ds, dy), (6) 

Jt-8 JRd 

where 



U^(t,0)= [ [ A{t — s, —y)cf{u{s,y))M{ds,dy) 

Jo Jr^ 

+ / [ A(t — s, —y)b(u{s, y))dyds 

Jo JRd 

b(u(t — £,0)) f f A(t — s, —y)dyds. 

J t—e Jr^ 



Applying the triangular inequality, we have 



f ^h^{y)K{dy) 
Jm. 



= |E[|a(M(r,0))|M^.^(M(r,0))]| 

< |E[(|or(M(f, 0))|” - \a{u{t - e, 0))]| 

+ |E[|a(M(f - e, O))|”(Z\^0(M(r, 0)) - 0)))]| 

+ |E[|or(M(f - e, 0))rAl<t>(uHt, 0))]|. (7) 



Remember that 1 1 A ^ 0 1 1 < 1 1 0 1 1 • Consequently, 

\Ai<p(x)\ = <c„_iii0ii^»i/*r, 



Using this fact, the first term on the right-hand side of the inequality in (7) can be 
bounded as follows: 



|E[(|a(M(f, 0»r - W(u(t - s, 0))nAl4>(u(t, 0))]| 

< Cn\m<ff^^\hrE[\\(T(u(t,0))\" - |a(M(? -e,0))r|]. (8) 

Apply the equality x" — y" = (x — y)(x"“* + x"“^y H h xy"~^ + y"“^) along 

with the Lipschitz continuity of cr and Holder’s inequality, to obtain 
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E[||ff(M(f, 0))|" - \a(u(t-s,0))f\] 

[ n — 1 - 

\a(u(t,0)) - a(u(t - s, 0))| ^ 
j=0 

< c(^E[|M(f, 0) - u(t - s, ^ \a(u(t, 0))\j\a(u(t, ^ 

< C„£^/2, (9) 

where we have used that a has linear growth, also that u(t,0) has finite moments of 
any order and (5). Thus, 

|E[(ior(M(f,o))r - w(u{t - s,o))nAi<t>{u(t,o))] \ < c„m^a\hrs^'\ (lo) 
With similar arguments, 

|E[|or(M(f - e, 0))r(Al(l,(u(l, 0)) - Al<P(u^t, 0)))]| 

< C„||0|k«E[|or(M(f - e, OWluit, 0) - M*(f, 0)|“] 

< C„||0|k«(E[|M(f,O) -M^(r,0)|2])“/2(E[|or(M(f -e,0))|2«/(2-«)])i-“/2 
< C„||0||^c,e^“/2(^i(e) + g2(e))“/2, (11) 

where in the last inequality we have used the upper bound stated in [20, Lemma 2.5] . 
It is very easy to adapt the proof of this lemma to the context of this section. Note 
that the constant Cn in the previous equation does not depend on a because 

(E[|a(M(f - e, o))| 2«/(2-«)])1-“/2 < (E[(|a(M(r - e, 0))| V i)2«])i-“/2 

< E[|o-(m( 1 -e,0))|2" V 1]. 

Now we focus on the third term on the right-hand side of the inequality in (7). Let 
Pe denote the density of the zero mean Gaussian random variable A(t — 

s, —y)M(ds, dy), which is independent of the a-field ^t-s and has variance 

g{s): = f ds [ > Cs^. 

Jo JW 

In the decomposition (6), the random variable U^(t, 0) is -measurable. Then, 
by conditioning with respect to ^t-s and using a change of variables, we obtain 
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|E [|ct(m(? - e, 0))] I 

= |E[E[l(,(„(,_,,o))^o)k(M(r-e,O))|M^0(M^(f,O))|#-,_,]]| 

^{a(u(t-s,0))m I k(M(f - e, 0))\"Al<p{U^ + a{u{t - e, Q))y)pg{y)dy 

l{cr(w(r— e,0))7^0} / 0)) I 

JM 

X 0(t/f + 0))}^)/^" 

< II 0 IIOOE l{cr(„(/-£,0))5^0)|o-(M(f - 



On the set {a(u(t — s, 0)) 7 ^ 0}, the integral in the last term can be bounded 
as follows, 



\^-a(u(t-s,0))-^hP>^^y^\dy ^ Cn\cr(u(t - e, 0 ))r"|/zn|pf||i,(; 

< Cn\a(u{t - s,0))\-"\hrg(s)-'^'\ 



where we have used the property || 2 ^^/|Il 1 (M) ^ IIl 1 (M)’ 

\\pi"^\\Li = te(e))“"/^ < C„e-">^/2 (seee.g. [20, Lemma 2.3]). 

Substituting this into the previous inequality yields 

\E[\a{u(t-e,0WAl<t>(uHt,0))]\<C„m<^.\hre-’'y/\ (12) 

because ||0||oo < 110 lk“- 
With (7), (10)-(12), we have 

[ Al4>{y)K{dy) 

7 r 

< C„||0||^.(|/ire^/2 + e^“/2(gi(e) +^2(e))“^^ + \h\'' e~''y 1^) 

< C„||0||^a(|/!re^/2 ^ g(5+n)«/2 ^ g(5+K2)«/2 ^ |/,|«e-nK/2^ 

< C„||0||^a(|/z|“e^/2 ^ gKK«/2 ^ |/,|«£-«K/2^ (13) 

Lets = with p = 2n/(]/^ + yy o'). With this choice, the last term in (13) 

is equal to 

/ Q, I n8 nya \ 

Cn\W\^^[\h\ ^y^n+ya) ^\k\n+Y0^y 
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Since yi < y, by the definition of }/, we obtain 

_ min{]/i, }/2} S S 

y - 1 = ^ 1 < -. 

r y r 

Fix ^ G (0, ]7 — 1). We can choose n e N sufficiently large and a sufficiently close 
to 1 , such that 



n8 nya 

a H z — > ^ + O' and ^ > ^ + o'. 

y{n-\-ya) n ya 

This finishes the proof of the theorem. 

Remark 1 (i) Assume that a is bounded from above but not necessary bounded 
away from zero. Following the lines of the proof of Theorem 1 we can also 
show the existence of a density without assuming the existence of moments 
of u(t, x) of order higher than 2. This applies in particular to SPDEs whose 
fundamental solutions are general distributions as treated in [8], extending the 
result on absolute continuity given in [20, Theorem 2.1]. 

(ii) Unlike [20, Theorem 2.1], the conclusion on the space to which the density 
belongs is less precise. We do not know whether the order y — 1 is optimal. 



3 Ambit Random Fields 



In this section we prove the absolute continuity of the law of a random variable 
generated by an ambit field at a fixed point (t,x) e [0, T] x The methodology 
we use is very much inspired by [10]. Ambit fields where introduced in [5] with 
the aim of studying turbulence flows, see also the survey papers [6, 15]. They are 
stochastic processes indexed by (t,x) e [0, T] x of the form 



X(t, a) = AO + 




g(t,s;x,y)cT(s, 



y)L{ds, dy) 




hit, s\ X, y)b(s, y)dyds. 



(14) 

where vq g M, g, /z are deterministic functions subject to some integrability and 
regularity conditions, cr,h are stochastic processes, At(x), Bt(x) c [0, r] x 
are measurable sets, which are called ambit sets. The stochastic process L is a Levy 
basis on the Borel sets ^([0, T] x M^). More precisely, for any B e ^([0, 
the random variable L(B) has an infinitely divisible distribution; given B\, . . . , B^ 
disjoint sets of B e =^([0, T] x M^), the random variables LiB\), . . . , L{Bk) are 
independent; and for any sequence of disjoint sets C ^([0, T] x W^), 



oo 

L(U^iAj) = ^^L(Aj), P-almost surely. 



130 



M. Sanz-Sole and A. SiiB 



Throughout the section, we will consider the natural filtration generated by L, i.e. 
for all t G [0, r], 



= <y{L{A)\ A e [0, r] x X(A) < oo). 

For deterministic integrands, the stochastic integral in (14) is defined as in [18]. 
In the more general setting of (14), one can use the theory developed in [7]. We refer 
the reader to these references for the specific required hypotheses on g and a . 

The class of Levy bases considered in this section are described by infinite divisible 
distributions of pure-jump, stable-like type. More explicitly, as in [18, Proposition 
2.4], we assume that for any B g ^([0, T] x M^), 

logE[exp(i^L(5))] = [ X{ds,dy) [ p^,^(Jz)(exp(i^z - 1 - i^zl[_l l](z))), 

J[0,T]xRd JR 

where A. is termed the control measure on the state space and (Ps,y){s,y)e[0,T]xRd 
a family of Levy measures satisfying 

/ min{l, z^}ps y(dz) = 1, A. - a.s. 

JR 

Throughout this section, we will consider the following set of assumptions on 

(Ps,y){s,y)e[0,T]xRd and on A.. 

Assumptions 1 Fix (t,x) g [0, T] x and a g (0, 2), and for any <2 > 0 let 
= (—(J, (J)- Then, 

(i) for all G [0, O') there exists a nonnegative function Cp G (A.) such that for 
all (2 > 0, 



/ \zf Ps.yidz) <Cp{s,y)a^ A-a.s.; 

JiffnY 



'{ffa)- 

(ii) there exists a non-negative function C e L* (A) such that for all a > 0, 

I \z\^Ps,yidz) < C{s, y)a^~“, X - a.s.; 

(iii) there exists a nonnegative function c G L^(X) and r > 0 such that for all § G 
with 1^1 > r. 



(1 - co^{^z))ps,y{dz) > c(s, y)|§r, A. - a.s. 
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Example 1 Let 

Ps,y(dz) = ci(s, 

with (s,y) G [0, T] x and assume that c\,c-i e L^(X). This corresponds to 
stable distributions (see [18, Lemma 3.7]). One can check that Assumptions 1 are 
satisfied with C = C = c\ V c_i, and c = ci A c-\. 

Assumptions 2 (HI) We assume that the deterministic functions g, h:{0 < s < t < 
T} xR^ xR^ ^ R and the stochastic processes (cr(s, y); (s, y) e [0, T] x R^), 
(b(s, y); (s, y) e [0, T] xM^) are such that the integrals on the right-hand side of (14) 
are well-defined (see the conditions in [18, Theorem 2.7] and [7, Theorem 4.1]). We 
also suppose that for any G g [ 2 , oo) wehave sup^^|^Q E[|a( 5 ', y)\^] < oo. 

(H2) Let a be as in Assumptions 1. There exist 8 \, 82 > 0 such that for some 
y G (a, 2] and, if Ck: > 1, for all ^ G [1, a), or for ^ = 1, if ck: < 1, 

E[|a(f, X) - a(^, y)\y] < Cy(\t - + |x - ( 15 ) 

E[\b(t, X) - b{s, y)f] < CfiQt - + |x - (16) 

for every (/, x), (^, y) e [0, T] x and some Cy,Cp > 0. 

(H3) |cr(f,x)| > 0, <w-a.s. 

(H4) Let a, C, Cp and c as in Assumptions 1 and 0 < e < r. We suppose that 

/ [ ^A,(x)is,y)cis,y)\g(t,s,x,y)\'"X(ds,dy) < 00, (17) 

Jt-s 

csy° < [ [ lA,(x)is,y)c{s,y)\g{t,s,x,y)fX{ds,dy)<oo, (18) 

Jt-s Jr^ 



where in (17), c(s, y) = C(s, y) V Co(*s’, y), and (18) holds for some yo > 0. 
Moreover, there exist constants C, y\, y 2 > 0 and y > a such that 



[ [ ^A,(x)(s,y)Cij(s,y)\g(t,s,x,y)\y\t - s - sf^yx(ds,dy) < Csyy\ 

Jt-e Jr<^ 

a 

[ [ ^A,(x)(s,y}Cp(s,y)\g{t,s,x,y)\y\x - yf^yX{ds,dy} < Cs^y^. 

Jt-s JR‘‘ 



( 20 ) 



We also assume that there exist constants C, 73 , 3/4 > 0 such that for all ^0 g [1, ck^), 
if Ckf > 1 , or for ^ = 1 , if ckf < 1 , 
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- e - sf^f^dyds (21) 



/ / \B,(x){s,y)\h{t,s,x, 

Jt-s JR‘> 



y)\^\x - yf^f’dyds < (22) 



where is defined as in Lemma 3. 

(H5) The set At(x) “reaches i.e. there is no £ > 0 satisfying At(x) ^ [0,t — 



Remark 2 (i) By the conditions in (H4), the stochastic integral in (14) with respect 
to the Levy basis is well-defined as a random variable in L^(^2) for any ^ g 
(0, a) (see Lemma 3). 

(ii) One can easily derive some sufficient conditions for the assumptions in (H4). 
Indeed, suppose that 



then (19) holds with y\ = yi + 5i. If in addition, Af{x) consists of points 
{s, y) G [0, r] X such that |v — y| < |/^ — for any s e [t — s, t], and 
for some ^ > 0, then (20) holds with Y2 = Yi + ^2^ - Similarly, one can derive 
sufficient conditions for (21), (22). 

(iii) The assumption (H5) is used in the proof of Theorem 2, where the law of 
X(t,x) is compared with that of an approximation X^(t, x), which is infinitely 
divisible. This distribution is well-defined only if At(x) is non-empty in the 
region [t — s,t] x 

(iv) Possibly, for particular examples of ambit sets At(x), functions g,h, and sto- 
chastic processes a, b, the Assumptions 2 can be relaxed. However, we prefer 
to keep this formulation. 

We can now state the main theorem of this section. 

Theorem 2 We suppose that the Assumptions 1 and 2 are satisfied and that 



Fix {t,x) G (0, T] X Then the law of the random variable X{t,x) defined by 
(14) is absolutely continuous with respect to the Lebesgue measure. 



3.1 Two Auxiliary Results 



s] X W^. 




min{}/i, 72, K3, K4} 1 



(23) 



> 



a 



In this subsection we derive two auxiliary lemmas. They play a similar role as those 
in [10, Sects. 5.1 and 5.2], but our formulation is more general. 
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Lemma 2 Let p = (Ps,y)(s,y)e[ 0 ,T]xRd be a family of Levy measures and let X be a 
control measure. Suppose that Assumption \(ii) holds. Then for all y G (a, 2) and 
all a G (0, oo) 



[ \z\^ Ps,y(dz) < Cy^aC{s,y)a'‘' X-a.s. 

J\z\<a 



where Cy^a = 2 2y^~°'-i ' lienee 



[ [ [ \z\^ Ps,y{dz)X{ds,dy) < Ca'^~ 

J 0 Jr^ ^\z\<a 



Proof The result is obtained by the following computations: 



« oo 

/ W Ps,y{dz) = V 

•^Iz|<a „=n 



_^J{a2 ^ ^<\z\<a2 



\zV Ps,y{dz) 



OO „ 

n=0 



{\z\<a2-"] 



\zfPs,y(dz) 



< C{s, y) 

n=0 

< Cy-aC(s, y)a'^~“. 



—n\2—a 



□ 



The next lemma provides important bounds on the moments of the stochastic 
integrals. It plays the role of [10, Lemma 5.2] in the setting of this article. 

Lemma 3 Assume that L is a Levy basis with characteristic exponent satisfying 
Assumptions \ for some a G (0, 2). Let H = {H(t, -^))(^,jc)e[0,r]xM'^ be a predictable 
process. Then for all0<f<a<y <2 and for all 0 < s < t < s 



n ^A,ix)(r, y)g{t,r,x, y)H{r, y)L{dr, dy) 

Id 

S Ca,p,y\t 

■(/X ^A,(x){r, y)Cp{r, r, x, j)|>'E[|//(m, y)\>''\X{dr, dy)^ 

(24) 



where C^{r, y) is the maximum ofC{r, y), and (Cp + Ci)(r, y) (see Assumptions 1 
for the definitions). 
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Proof There exists a Poisson random measure N such that for all A g 

L([5’, /^] X A) = 



m zN{dr,dy,dz) A [ [ [ zN(dr,dy,dz) 
,:\<1 Js JaJ\z\>1 



(see e.g. [14, Theorem 4.6]), where N stands for the compensated Poisson random 
measure N(ds, dy, dz) = N(ds, dy, dz) — ps,y{dz)'k{ds, dy). Then we can write 



E 

with 

rl 



Js Jw 



y)g{t, r, X, y)H{r, y)L(dr, dy) 



(25) 






lid = E 



/?,: = E 



n ^A,{x){r,y) [ zg{t,r,x,y)H{r,y)N{dr,dy,dz) 

n iA,(x)(r,y) [ zg(t,r,x,y)H(r,y)N(dr,dy,dz) 

n ^A,{x){r,y) [ zg(t,r,x,y)H{r,y)N(dr,dy,dz) 

J\z\>l 



To give an upper bound for the first term, we apply first Burkholder’s inequality, then 
the subadditivity of the function v (since the integral is actually a sum), 

Jensen’s inequality, the isometry of Poisson random measures and Lemma 2. We 
obtain. 



<C^e[ f [ lA,(x)(r,y) [ 

L Js Jm^ J\z\ 



' \z\<{t—sp/^ 

X \z\^\g(t, r, X, y)\^\H{r, y)\^N{dr, dy, dz) 



P!2- 



<QE 



r f U,(x)(r,y) I 
Js Jr*' J\z\ 



X \zV\g(t, r,x, y)Y\H(r, y)^ N {dr, dy , dz) 






mr 



X \zV\g{t, r,x, y)\>'\H{r, y)\>'N{dr, dy, 



PlY 



= CAA f [ lA,(x)(r,y)( [ \zVpr,y{dz^ 

\ Us JmJ \ j | z |<(; -«)'/“ / 
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X \g(t,r,x,y)\'>'\H{r,y)\>'X(dr,dy) 

■ Ui ^A,(x){r, y)C(r, )')|g(r, r, x, j)|’'E[|7/(m, y)\y]X(dr, 



/i/y 



Notice that the exponent (y — a) /a is positive. 

With similar arguments but applying now Assumption l(i), the second term in 
(25) is bounded by 



Kr < CpE 



[ [ ^A,(x)(r,y) [ kl 



X \g{l, r, X, y)\^\H{r, y)\^N{dr, dy, dz) 



cA f j lA,ix)ir,y) [ 
_Js J (t- 



fi/2- 






X \g{l, r, X, y)f\H{r, y)fN{dr, dy, dz) 
I lA,(x)(r,y)i , 

I (t— 5)l/“<|z|; 

X \g{t,r,x,y)f\H{r,y)\^X(dr,dy) 
r rt 



= CfiE [ [ lA,(x){r,y)( [ \z\^ Pr,y(dz)\ 



[ [ lA,(x)ir.y)Cp(r,y) 

.Js Jw 

X \g{t, r, X, y)f[\H(r, y)fX(dr, dy) 

< [ [ lA,(x)ir,y)Cp(r,y) 

\Js JRd 

\P/y 

X \g{t,r,x,y)\yE[\H{r,y)\y]X{dr,dy)j 



where in the last step we have used Holder’s inequality with respect to the finite 
measure C^(r, y)k(dr, dy). 

Finally, we bound the third term in (25). Suppose first that < 1. Using the 
subadditivity of v and Lemma 2(i) yields 



ilt < CpE 
<C^E 




^A,(x){r,y) [ \z\^\g{t,r,x, 

J\z\>\ 

iA,(x)(r,y)( [ \zfpr,y{dz) 
\J\z\>\ 



y)f\H{r, y)fN{dr, dy,dz) 

) 
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X \g{t,r,x,y)f\H{r,y)fX{dr,dy) 



<C/iE 



^A,(x){r, y)Cp{r, y)\g{t, r, x, y)f\H{r, y)\^X(dr, dy)^ 



(//.. ^A,(x)(r, y)C^{r, )’)|g(r, r, x, y)\^E[\H{r, y)\^]X(dr, dy)^ 

where in the last step we have used Holder’s inequality with respect to the finite 
measure C^(r, y)k(dr, dy). 

Suppose now that > 1 (which implies that Ck: > 1). We apply Holder’s inequality 
with respect to the finite measure Ci (r, y)k{dr, dy) and Assumption l(i) 



lit < 2^“‘E 



[ [ ^A,(x){r,y) [ zg{t,r,x,y)H{r,y)N{dr,dy,dz) 

Js ^|z|>l 

[ f ^A,ix)(r,y)( [ \z\Pr,y{dz)) 

Js JmJ w|z|>1 / 



+ 2^“^E 
X g(t, r, X, y)H(r, y)X(dr, dy) 






f [ iA,(x)(r,y) f \z\^\g(t,r,x,y)\ 

Js JmJ J \z\>l 



kl 

)0/2n 



X |if(r,y)pA(Jr, Jy, Jz) 



+ C^E 
<C^E 

+ Cfi 



. y)C\{r, y)\g{t, r, x, y)\H{r, y)\X{dr, dy) 



f f lA,w(r,. 

Js Jr'I 

^A,(x){r,y) [ \z\^\g(t,r,x,y)\^\H{r,y)fN{dr,dy,dz) 

J\z\>l 

(IL Ci(r, y)k(dr, 

f [ iA,(x)ir,y)Ci{r,y)\g{t,r,x,y)fE[\H{r,y)\^]X(dr,dy) 

J s J'EJ 



<CpE / / lA,ix)(.r,y){Ci{r,y) + Ci}(r,y)) 

.Js JR‘> 

X \g{t,r,x,y)f\H(u,y)\^X{dr,dy) 

<cJ f [ lA,(x)ir, y){Ci{r, y) + Cp{r, y)) 

\Js JR‘> 

\filY 

X \g{t,r,x,y)\^E[\H{u,y)\^]X{dr,dy)\ 
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where in the last step we have used Holder’s inequality with respect to the finite 
measure (C\(r, y) + C^(r, y))k(dr, dy). We are assuming 0 < ^ — 5' < 1, and 
0 < < O'. Hence, the estimates on the terms ^ / = 1, 2, 3 imply (24) 

3.2 Existence of Density 

With the help of the two lemmas in the previous subsection, we can now give the 
proof of Theorem 2. Fix (t,x) e (0, T] x and let 0 < £ < r to be determined 
later. We define an approximation of the ambit field X(t,x)hy 

X^(t,x) = U^(t,x)-\-a(t-£,x) [ [ lA,{x)(s,y)g(t,s;x,y)L(ds,dy), (26) 

Jt-s JRd 



where 



U^(t,x)=xo+[ [ lA,(x)is,y)8(t,s;x,y)(T(s,y)L(ds,dy) 

Jo 

+ / / lB,{x){s,y)h{t,s;x,y)b(s,y)dyds 

Jo JR'I 



/ lB,{x)(s,y)h{t,s;x,y)dyds 
R‘‘ 



Note that U^(t, x) is -measurable. 

The stochastic integral in (26) is well defined in the sense of [18] and is a random 
variable having an infinitely divisible distribution. Moreover, the real part of its 
characteristic exponent is given by 



-h b{t — 6, x) 



L 



where 



9f(logE[exp(/§A)]) = [ (1 - cos(§z))p/(Jz), 
JR 




i^z)k{ds , dy). 



In the setting of this section, the next lemma plays a similar role as [20, Lemma 
2.3]. It generalizes [10, Lemma 3.3] to the case of Levy bases as integrators. 

Lemma 4 The Assumptions 1, along with (17) and (18) hold. Then, the random 
variable ^ 

/ / ^At{x){s,y)g{t,s,x,y)L(ds,dy) 

Jt-s JRd 



has a -density pt,x,s, and for < 2 // ^ G N there exists a finite constant > 0 such 

that ||Pr7,£llL‘(R) - Cn,t,x{s'*'^ ^ 1)“"/". 
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Proof We follow the proof of [10, Lemma 3.3], which builds on the methods of [21]. 
First we show that for | sufficiently large, and every t G (0, L], 

c,,x,e\^r (27) 

Indeed, let r be as in Assumption l(iii). Then, for | > r, we have 

[ {I -cos{^ z))pf(dz) 

M 

= / / k(ds,dy) {1 -cos{^zlA,(x)(s,y)g(t,s,x,y)))ps,y(dz) 

Jt—e 

> l^r / [ iA,(x)is,y)\g{t,s,x,y)fcis,y)X{ds,dy) 

Jt-s JR'I 

>c,,s,xS^^\^r- (28) 

This proves the lower bound in (27) for | > r. 

In order to prove the upper bound in (27), we set 

a^,t,s,x,y- = III 1/1, (;>:)(.?, 

and use the inequality (1 — cos(x)) < 2(x^ A 1) to obtain 

m>Px(^)= [ [ k(ds,dy) [ { 1 -cos(z^lA,ix)(s,y)g(t,s,x,y)))ps,y(dz) 

Jt-e Jw^ 7 r 

<2 f I X{ds,dy) [ {\z\^\^\^\At(x){s,y)\g{t,s,x,y)\^ A\)ps,y{dz) 
Jt-s 7 r 

= 2 f f k(ds,dy) [ \z\^\H?"^A,{x)(s,y)\g(t,s,x,y)\^Ps,y{dz) 

Jt-eJvd 

: [ [ k(ds,dy) [ Ps,y(dz). 



+ 2 



(29) 



Then, using Assumption l(ii), the first integral in the right-hand side of the last 
equality in (29) can be bounded as follows: 



f [ k(ds,dz)\^\^lA,(x)(s,y)\g{t,s,x,y)\^ 

Jt-s Jr'I 

<111“/' / lA,ix}(s,y)\g(t,s,x,y)rC(s,y)k(ds,dy) 

J t—s 

<C|||“, 



if kl^P5,y(^i?2) I 



where in the last inequality, we have used (17). 
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Consider now the last integral in (29). By applying Assumption l(i) with ^ = 0 
and (17) 



[ [ k(ds,dy)\ [ I 

Jt-sJW / 

< 111“ [ [ 1a,w(5, y)Co(^, j)|g(f, X, yTHds, dy) 

Jt-s 

Hence, we have established that 






for |§| sufficiently large. 

To complete the proof, we can follow the same arguments as in [10, Lemma 3.3] 
which rely on the result in [21, Proposition 2.3]. Note that the exponent yo on the 
right-hand side of the gradient estimate accounts for the lower bound of the growth 
of the term in (18), which in the case of SDEs is equal to 1. 

The next lemma shows that the error in the approximation in (26) and 

the ambit field X(t,x) is bounded by a power of s. 

Lemma 5 Assume that Assumptions 1 hold for some a G (0, 2) and that cr, b are 
Lipschitz continuous functions. Then, for any f G (0, a), and £ G (0, ^ A 1), 

E[\X(t, x) - V(r, x)f] < 



where y: = min{}/i, y 2 , ys, y 4 j. 
Proof Clearly, 



E[|Z(f,x) - V(r,x)|^] 

< QE / / \At(x)is,y)g{t,s\x,y){a{s,y) - a{t - e,x))L{ds,dy) 

L Jt-sJW 

+ CpE [ [ \B,(x){s,y)h{t,s\x,y){b{s,y) -b{t - s,x))dyds 

L Jt-eJW 



Fix y G (o', 2] and apply Lemma 3 to the stochastic process H{s,y)\ = cr(5’, y) — 
a(t — s,x), where the arguments t,s,x are fixed. We obtain 
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/ / \A,(x){s,y)g{t,s\x,y){a{s,y) -a{t - s,x))L{ds,dy) 

Jt-e 

[ ( \A,(x){s,y)Cp{s,y)\A,(x)\g{t,s,x,yW 

\Jt-s JR‘> 

X E[|cr(^, j) — (r(t — e, dy^ 



E 



J/y 



Owing to hypothesis (H2) this last expression is bounded (up to the constant 

Ca,^,ye^/““^)by 

0 ^ / ^A,(x){s,y)Cp{s,y)\A,{x)\g{t,s,x,y)\^ 

t-s JW 

\Ny 

X (|t - e - + |x - y\^^^)X(ds, dy) I 



The inequality (19) implies 

p/3/a- 



iLL l/i,(jc)(s, y)Cfi{s, y)\g{t, s,x, - e - 

Ce^4+n)-i, 



PlY 



and (20) yields 



y)C^{s, y)\g{t,s,x, y)v\x - y 



\^/y 

dy) I 



Thus, 



E 



-(/: 

Ce^4+K2)-i. 

/ / lA,ix){s,y)g(t,s;x,y)(a(s,y)-a(t,x))L(ds,dy) 

Jt-e Jr^ 

< Ce'®4+[M'^K2])-i_ 



(30) 



Assume that /6 > 1 (and therefore a > 1). Holder’s inequality with respect to the 
finite measure h(t, s; x, y)dyds, (H2), (21), (22), imply 



/ / \B,(x){s,y)h{t,s\x,y){b{s,y) -b{t - s,x))dyds 

Jt-e JR‘‘ 

<Cfi f [ lB,(x)(s,y)\h(t,s,x,y)fE[\b(s,y)-b(t-s,x)\f^]dyds 

Jt-8 JR^ 
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<C^ [ [ lB,(x)(s,y)\h{t,s,x,y)f\t-s-sf^^dyds 

J t—e 

+ Cp f [ lB,(x)is,y)\h{t,s,x,y)\^\x - yf^l^dyds 

Jt-e JW 



Suppose now that p < 1, we use Jensen’s inequality and once more, (H2), (21), 
(22), to obtain 



/ / \B,(x){s,y)h{t,s-,x,y){b{s,y)-b{l-s,x))dyds 1 

Jt-s JW J 

< iB,(x)(Syy)\h(t,s;x,y)\\b(s,y) - b(t - e, x)|(/jd^ jy 

=iLL ^B,(x)(s, y)\h{t, s; X, y)|E[|/7(s, y) - b(t - s, x)\]dyds^ 
(/ ItR‘1 + \x -yf^]dyds^ 



£ C 



This finishes the proof. 

We are now in a position to prove Theorem 2. 

Proof {Proof of Theorem!) We consider the inequality 

m<y{t,x)\’'Al4>{X{t,x))]\ < |E[(|a(r,x)r - \a{t-£,x)\’')Al4>iXit,x))\\ 

+ \¥\\a{t-s,x)\''{Al<p{X{t,x))-Al4>{X^{t,x)))]\ 
+ \^[\a(t-s,x)\’'Alcl>(X^{t,x))\\. (31) 



Fix G (0, Ck: A 1). As in (8) we have 



|E[(|CT(r,x)|" - |CT(r - e,x)r)z\^0(Z(r,x))]| 

< Cnmv^^\h\^^\\<y{t,x)\’^ - |a(r -e,x)r|]| 

Now we proceed as in (9) using the finiteness of the moments oi a {t,x) stated in 
Hypothesis (HI), and (H2). Then for all y G (ck:, 2] we have 



E[||tj(A x)\^ — \a{t — £, v)|"|] 



= E 



n — \ 

a(t,x) — a {t — v)| |cr(A v)K \cf{t — £, x)\^~^~-^ 

7=0 
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< C ^E[|ct(A x) — (7 {t — e, x)|^]^ 

X ^E|^^^|or(Ax)KV(r -e,x))| 



K/(K-1)1\1-1/K 



^j=0 



]) 



<C„s^'. 



Therefore 



|E[(|a(f,x)r - \a{t-s,x)r)Al<p{X(t,x))]\ < C„ |/i| . (32) 

Consider the inequality \\A^(j)\\(^a < Cn||0||^«, and apply Holder’s inequality 
with some ^ G (j), a) io obtain 



|E[|a(r - e, x)\’'{Al4>{X(t, x)) - Al4>(X\t, x)))]| 

< Cn\m<^n^\cy{t - s, x)r|Z(r, X) - XHt, x)|"] 

< C„||0||^^,(E[|Z(?,x) -X^(r,x)|q)"/^(E[|a(M(? 

(33) 

where y\ = mm{y\, K2, K3, K4}, and we have applied Lemma 5. 

Conditionally to ^t-e^ the random variable 




y)git, s\ X, y)L{ds, dy) 



has an infinitely divisible law and a -density pt^xs for which a gradient estimate 
holds (see Lemma 4). Then, by a discrete integration by parts, and owing to (H3), 



\E[\ait-s,x)\^Al4>{X\t,x))]\ 
= E 



f \a{t - s, x)\''Al4>{Ut + <y{t - s, x)y)pt,:c,e(y)dy 
m 

f |cr(f - s, x)\"(j)(Uj + cr(f - e, x)y)A’'_^^^_^^^^_i^pt,x,siy)dy 
_ JM. 

< ||(^||ooE \a{t-s,x)f f \A''_^^^_^^^^_ij^p,^x,s{y)\dy . 



From Lemma 4 it follows that 



[ \^%^,_,,x)-^HP>xAy)\dy < c„\ait-s,x)A\hn\pl"l,h,^ 

i/ M 



< Cn\a{t - s,x)\-"\h\"s-"y°^“, 
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which yields 

\E[\cr{t-s,x)rAlcj>{X^{t,x))]\ < CnU\Wn\hrs-’^y<^/\ (34) 

because ||0||oo 5 mWn- 

The estimates (31)-(34) imply 

|E[|a(?,x)|M"0(Z(r,x))]| < 

Set s = §|/z|^, with \h\ <\ and 

a(n — rf) 

P + a^y — a ’ ny^ 

Notice that, since lim^^oo for ^0 close to a and yo as in the hypothesis, 

this interval is nonempty. Then, easy computations show that with the choices of s 
and p, one has 

with ^ > Tfj. Hence, with Lemma 1 we finish the proof of the theorem. 

Remark 3 (i) If a is bounded away from zero, then one does not need to assume 
the existence of moments of sufficiently high order. In this case one can follow 
the strategy in [20]. 

(ii) The methodology used in this section is not restricted to pure-jump stable-like 
noises. One can also adapt it to the case of Gaussian space-time white noises. 
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Part II 
Applications 




Dynamic Risk Measures and Path-Dependent 
Second Order PDEs 



Jocelyne Bion-Nadal 



Abstract We propose new notions of regular solutions and viscosity solutions for 
path-dependent second order partial differential equations. Making use of the martin- 
gale problem approach to path-dependent diffusion processes, we explicitly construct 
families of time-consistent dynamic risk measures on the set of cadlag paths IR^ val- 
ued endowed with the Skorokhod topology. These risk measures are shown to have 
regularity properties. We prove then that these time-consistent dynamic risk mea- 
sures provide viscosity supersolutions and viscosity subsolutions for path-dependent 
semi-linear second order partial differential equations. 

Keywords Path-dependent PDE • Risk measures • Martingale problems 

MSC: 35D40 • 35R15 • 35K55 • 60J60 • 91B30 



1 Introduction 

Diffusion processes are linked with parabolic second order Partial Differential 
Equations via the “Eeynman-Kac” formula. The field of path-dependent PDEs first 
started in 2010 when Peng asked in [19] whether a BSDE (Backward Stochastic 
Differential Equation first introduced in [17]) could be considered as a solution to a 
path-dependent PDE. In line with the recent literature on the topic, a solution to a 
path-dependent second order PDE 

H(u, CO, <p{u, co), du4>(u, co), Dx(t){u, co), D^cj){u, co)) = 0 (1) 

is searched as a progressive function cj){u,(o) (i.e. a path dependent function depend- 
ing at time u on all the path up to time u). 
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In contrast with the classical setting, the notion of regular solution for a path- 
dependent PDE (1) needs to deal with cMlag paths. Indeed to give a meaning to 
the partial derivatives D^(t){u, co) and D^(j){u, cd) at {uq, coq), one needs to assume 
that (p(uo, co) is defined for paths co admitting a jump at time uq. Peng has intro- 
duced in [20] a notions of regular and viscosity solution for path-dependent second 
order PDEs. In [20] a regular or a viscosity solution for a path-dependent PDE is a 
progressive function (j){t,co) defined on the space of cMlag paths endowed with the 
uniform norm topology and the notion of continuity and partial derivatives are those 
introduced by Dupire [12]. A comparison theorem is proved in this setting [20]. 
The motivation comes mainly from the theory of BSDE and examples of regular 
solutions to path-dependent PDEs can be constructed from BSDEs [18]. The main 
drawback for this approach based on [12] is that the uniform norm topology on the 
set of cMlag paths is not separable, hence it is not a Polish space. Recently Ekren et 
al. proposed a notion of viscosity solution for path-dependent PDEs in the setting of 
continuous paths in [13, 14]. This work was motivated by the fact that a continuous 
function defined on the set of continuous paths does not have a unique extension 
into a continuous function on the set of cMlag paths. Therefore it is suitable that 
the notion of viscosity solution for functions defined only on the set of continuous 
paths does not require to extend the function to the set of cMlag paths. The approach 
developed in [13, 14] is also based on BSDE. 

In the present paper we introduce a new notion of regular and viscosity solution 
for path-dependent second order PDEs (Sect. 2). A solution to (1) is a progressive 
function 0 defined on 7R+ x Q where Q is the set of cMlag paths. In contrast with [20] 
and many works on path-dependent problems, we consider the Skorokhod topology 
on the set of cMlag paths. Thus ^2 is a Polish space. This property is very important. 
To define the continuity and regularity properties for a progressive function, we make 
use of the one to one correspondence between progressive functions on 7R+ x Q 
and strictly progressive functions on x x IR^ established in [3]. A function 
0 defined on 7R+ x ^2 is progressive if 0(5', a;) = (p(s, co') as soon as co(u) = co'(u) 
for all 0 < M < 5". A function 0 defined on 7R+ x x IR^ is strictly progressive 
if cp(s, CO, x) = cp(s, co', x) as soon as co(u) = co'(u) for all 0 < w < 5-. The one to 
one correspondence is given hy cl)(s, co, x) = cj)(s, co ^) where (co x)(u) = co(u) 
for all 0 < u < s and (co x)(u) = x for all w > 5-. The continuity and regularity 
properties that we want for a progressive function 0 are derived from the usual 
continuity and regularity properties for 0 via the above one to one correspondence. 
Eor example, DxCp(u, co) is defined as DxCp(u, co) := DxCp(u, co, co(u)) where Dxcj) is 
the usual partial derivative of 0 with respect to the third variable. Notice that via the 
above one to one correspondence, the regularity properties for a progressive function 
0 are defined in a very natural way. This is in contrast with the most commonly used 
regularity definitions first introduced in [12]. 

The notion of viscosity solution that we introduce in the present paper is motivated 
by our construction of a solution to semi-linear second order path-dependent PDEs 
based on the martingale problem approach. 

Our study for viscosity solutions of path-dependent PDEs allows then to introduce a 
new definition of viscosity solution for path-dependent functions defined only on the 
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set of continuous paths. As in [13, 14], this does not require to extend the function 
nor the coefficient functions appearing in the path-dependent PDE to the set of cadlag 
paths. However our approach is very different from the one introduced in [13, 14]. 
In the present paper we construct then time consistent dynamic risk measures on the 
set of cadlag paths, to produce solutions for path-dependent semi-linear second 
order PDEs. 

duv(u, co) -h ^^v(u, co) CO, Dxv(u, co)) = 0 on [0, t] x Q 

v(t, 0 )) = h{ao) 



with co) = ^Tr[a(u, co)D^v{u, oo)]. 

These dynamic risk measures are constructed using probability measures solution 
to a path-dependent martingale problem. This approach is motivated by the Eeyn- 
man Kac formula and more specifically by the link between solutions of a parabolic 
second order PDE and probability measures solutions to a martingale problem. The 
martingale problem has been first introduced and studied by Stroock and Varadhan 
[10, 11] in the case of continuous diffusion processes. The martingale problem is 
linked to stochastic differential equations. However the martingale problem formu- 
lation is intrinsic and is very well suitable to construct risk measures. In [22] the 
martingale problem has been extended and studied to the case of jump diffusions. In 
[3] , the study of the martingale problem is extended to the path-dependent case which 
means that the functions a and h (and also the jump measure) are no more defined 
on IRj^ X IR^ but on 7R+ x Q . The question of existence and uniqueness of a solution 
to a path-dependent martingale problem is addressed in [3] in a general setting of 
diffusions with a path-dependent jump term. In the case where there is no jump term 
and under Lipschitz conditions on the coefficients, the existence and uniqueness of a 
solution has been already established in [8] from the stochastic differential equation 
point of view. 

In Sect. 3, we recall some results from [3] on the martingale problem for path- 
dependent diffusion processes and study the support of a probability measure solution 
to the path-dependent martingale problem for 

The theory of dynamic risk measures on a filtered probability space has been 
developped in recent years. In the case of a Brownian filtration, dynamic risk mea- 
sures coincide with g-expectations introduced by Peng [21]. An important property 
for dynamic risk measures is time consistency. The time consistency property for 
dynamic risk measures is the analogue of the Dynamic Programming Principle. Eor 
sublinear dynamic risk measures time consistency has been characterized by Del- 
baen [9]. Eor general convex dynamic risk measures two different characterizations 
of time consistency have been given. One by Cheridito et al. [7], the other by Bion- 
Nadal [5]. This last characterization of time consistency is very useful in order to 
construct time consistent dynamic risk measures. 

Eollowing [5], one can construct a time consistent dynamic risk measure as soon 
as one has a stable set of equivalent probability measures ^ and a penalty defined 
on ^ satisfying some conditions. In Sect. 4, we construct a stable set of probability 
measures on the set Q of cMlag paths. In the whole paper <2 is a given bounded 
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progressively continuous function defined on x such that a(s,co) is invertible 
for all (s, co). For all r > 0 and co e the set of probability measures ^r,o) is a stable 

set generated by probability measures solution to the martingale problem for 
^a,b starting from co at time r. The functions b are assumed to satisfy some uniform 
BMO condition. In Sect. 5 we construct penalties on the stable set ^r,co from a path- 
dependent function g. Some growth conditions are assumed on the function g to 
ensure integrability properties for the penalties. With such a stable set and penalties, 
we construct in Sect. 6 time consistent convex dynamic risk measures. More precisely 
for all r and co we construct a time consistent convex dynamic risk measure on 
the filtered probability space (^t), Qr,co) where 2^^ means 2^’^ and (^t) is 
the canonical filtration. 

We prove furthermore in Sect. 7 that these time consistent dynamic risk measures 
satisfy the following Feller property: Let % be the set of measurable functions 
h defined on which can be written as h(co) = k(co,co(t)) for some continuous 
function k on x IR^ such that k(co,x) = k(co\ x) if co(u) = co\u) for all w < t. 
Then for all h in there is a progressively lower semicontinuous function R(h) on 
[0, t] X Q such thnt R(h)(t, co) = h(co), 

plf\h) = R{h){r, o;o) VO < r < r 
Furthermore, for all 0 < r < 5’ < t, 

Pl’^(h)(co') = R{h){s, co') a.s. 

We prove furthermore in Sect. 8 that the lower semicontinuous function R(h) is 
a viscosity supersolution for the path-dependent semi linear second order partial 
differential equation (2). The function/ : 7R+ xIR^ x IR^ IR appearing in Eq. (2) 
is linked to the choice of the penalty of the risk measure. It is convex in the last 
variable. 

We prove also that the upper semi-continuous envelope of R(h) is a viscosity subso- 
lution for (2). 

When the above function h is defined only on the set of continuous paths, it is the same 
for the function RQi). We prove then thnt RQi) provides a viscosity supersolution and 
a viscosity subsolution for (2) on the set of continuous paths. 



2 Solution of Path-dependent PDEs 

In this section we introduce new notions for regular and viscosity solutions for second 
order path-dependent PDEs on the set of cMlag paths. In contrast with [20] and all 
the papers using the notions of continuity and derivative introduced by Dupire [12], 
we work with the Skorohod topology on the set of cMlag paths. A solution to a 
path-dependent PDF (1) is a progressive function ct){t,co) where t belongs to 7R+ and 
CO belongs to the set of cMlag paths. 
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2.1 Topology and Regularity Properties 

In the whole paper Q denotes the set of cadlag paths with the Skorohod topology. 
The set ^ is then a Polish space (i.e. is metrizable and separable). Polish spaces have 
nice properties which are very important in the construction of solutions for path- 
dependent PDEs. Among them are the existence of regular conditional probability 
distributions, the equivalence between relative compactness and tightness for a set 
of probability measures, to name a few. 

To define the continuity and regularity properties for progressive functions, we use 
the one to one correspondence between progressive functions on 7R+ x Q and strictly 
progressive functions on 7R+ x x IR^ that we have established in [3]. 

A function 0 defined on 7R+ x ^2 is progressive if (p(s, co) = (p(s, co') as soon as 
co(u) = co'(u) for ailO < u < s. 

A function 0 defined on 7R+ x Q x IR^ is strictly progressive if 0(5’, a;, x) = 
0(5', co\ x) as soon as co(u) = co'(u) for all 0 < w < 5-. 

The one to one correspondence 0 ^ 0 is given by 0(5", a;, x) = 0(5", a; x) where 



Notice that co) = cj){s,(o, co(s)). Accordingly a progressive function 0 (in 2 
variables (s,co)) is said to be progressively continuous if the associated function 0 
(in 3 variables ( 5 -, a;, x)) is continuous on 7R+ x Q x IR^. 



2.2 Regular Solution 

Making use of the one to one corrrespondence between progressive functions on 
IRj^ X Q and strictly progressive functions on 7R+ x Q x IR^, we can then give the 
following definition for a solution to a general path-dependent PDE. 

Definition 1 Let v be a progressive function on x Q where Q is the set of 
cMlag paths with the Skorokhod topology, v is a regular solution to the following 
path-dependent second order PDE 



CO *5 x{u) = (o{u) V 0 < w < 5 - and co x{u) = xV u > s. 



( 3 ) 




if the function v belongs to x Q x IR^) and if the usual partial derivatives 

of V satisfy the equation 



H{u, CO CO, x), duv(u, CO, x), Dxv{u, co, x), D^v(u, co,x) = 0 (5) 



with v(u, CO, x) = v(u, CO 
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(co Ws < u, and (co ■^)(‘^) = xWs > u. The partial derivatives of v 

are the usual ones, the continuity notion for v is the usual one. 



2.3 Viscosity Solutions on the Set of Cadlag Paths 

The following definitions are motivated by the construction of viscosity solutions for 
path-dependent PDEs that we develop in the following sections. Our construction of 
solutions is based on the martingale problem approach for path-dependent diffusions. 
The support of every probability measure solution to the martingale problem 
for starting from coq at time r is contained in the set of paths which coincide with 
coQ up to time r. This is a motivation for the following weak notion of continuity and 
also for the weak notion of local minimizer (or local maximizer) that we introduce 
in the definition of viscosity solution. 

Definition 2 A progressively measurable function v defined on 7R+ x ^2 is contin- 
uous in viscosity sense at (r, (z>o) if 

v(r, coo) = lim{v(5’, co), (s, co) e Dg{r, o^o)} (6) 



where 



Dg(r, coq) = {(s, co), r < s < r co(u) = coo(u), VO < w < r 

co(u) = co(s) 'iu > 5’, and sup \\cjo{u) — coo(r)\\ < s] (7) 

r<u<s 



V is lower (resp. upper) semi continuous in viscosity sense if Eq. (6) is satisfied 
replacing lim by lim inf (resp. lim sup). 

Definition 3 Let v be a progressively measurable function on (7R+ x (^r)) 
where is the set of cMlag paths with the Skorokhod topology and (^t) the canon- 
ical filtration. 

1. V is a viscosity supersolution of (4) if v is lower semi-continuous in viscosity 
sense, and if for all (^o, coo) ^ x t2, 

• V is bounded from below on Ds(to, coq) for some e > 0. 

• for all strictly progressive function 0 g ’ (7R+ x x IR^) such that 
v(^o, coo) = cp(to, coo), and (^o, coo) is a minimizer of v — 0 on Dg(to, coo) for 
some £ > 0, 

H(u, CO X, cj){u, CO, x), duCp(u, CO, x), Dxcj){u, CO, x), D^cj){u, co,x) >0 (8) 



at point (^ 0 , coo, coo(to)). 
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2. V is a viscosity subsolution of (4) if v is upper semi-continuous in viscosity sense, 
and for all (^o, coo), 

• V is bounded from above on Dg(to, coq) for some e > 0 

• for all strictly progressive function 0 g ’ (7R+ x ^2 x IR^) such that 
v(to, coo) = cj) (to, coo), and (^o, coo) is a maximizer of v — 0 on Dg(to, coo) for 
some £ > 0, 

H(u, CO X, <p(u, CO, x), du<p(u, CO, x), Dxcj)(u, co, x), D^cj)(u, co,x) <0 (9) 



at point (to,coo,coo(to)). 

3. V is a viscosity solution if v is both a viscosity supersolution and a viscosity 
subsolution. 



2.4 Viscosity Solution on the Set of Continuous Paths 

Recently Ekren et al. [13, 14] introduced a notion of viscosity solution of a path- 
dependent second order PDE for a function v defined on the set of continuous paths. 
One motivation for this was to define the notion of viscosity solution without extend- 
ing the function v to the set of cadlag paths. 

We can notice that within our setting we can also define a notion of viscosity solution 
for a function v defined only on the set of continuous paths without extending v. We 
give the following definition which is very different from that of [13, 14] and much 
simpler. 

Definition 4 Let v be a progressively measurable function defined on 7R+ x 
^(IR^, IR^) with the usual uniform norm topology, v is a viscosity supersolu- 
tion of (4) if V is lower semi-continuous in viscosity sense and for all (^o, coo) G 

7R+ X ^(1R^,1R^) 

• V is bounded from below on Dg(to, coo) for some £ > 0, 

• for all function strictly progressive 0 G ’ ’ (7R+ x ^2 x IR^) such that v(^o , coo) = 
cp(to, coo), and (^o, coo) is a minimizer of v — 0 on Dg(to, coo) for some £ > 0, 

H(u, CO cj)(u, CO, x), du(p(u, CO, x), Dxcj)(u, co, x), D^cj)(u, co,x) >0 (10) 

at point (^ 0 , coo, coo(to)). 

Here is the set of cadlag paths with the Skorokhod topology, Dg is the intersection 

of Dg with the set of continuous paths, and cl)(u, co) = (j)(u, co, co(u)). 

The lower semi-continuity property in viscosity sense at (^o, coo) in Definition 4 
v(to, coo) = lim{v(^, co), (s, co) G £>^(^0, ^^o)} 



means 
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We have a similar definition for a viscosity subsolution. Notice that the continuity 
along the sets Dg is also considered in [13, 14]. However the notion of viscosity 
solution introduced in [13, 14] is fundamentally different from ours. 

We will now construct time consistent dynamic risk measures making use of proba- 
bility measures solution to a path-dependent martingale problem. We will then prove 
that this leads to viscosity solutions to path-dependent PDEs (2). 



3 Path-dependent Martingale Problem 



In the classical setting, the Feynman Kac formula establishes a link between a solu- 
tion of a parabolic second order PDE and probability measures solutions to a mar- 
tingale problem. Assume that v is a solution of the PDE duv{t, x) -h x) = 0, 

v(T, .) = h with 

x) = -Tr{a{t, x))D^(v)(t, x) -h b{t, x)*DxV(t, x) 

From the Feynman Kac formula, the value v(t,x) can be expressed from the prob- 
ability measure solution to the martingale problem associated to the operator 
^a,b starting from v at time t. v(t, x) = EQa,b(h(XT)), where (Xu) is the canonical 
process. 

One natural way to construct soliutions for path-dependent parabolic second order 
partial differential equations is thus to start with probability measures solution to 
the path-dependent martingale problem associated to the operator for path- 
dependent coefficients a and b. Let Q be the set of cadlag paths and (^t) be the 
canonical filtration. Let a and b be progressively measurable functions on 7R+ x ^2 
(a takes values in non negative invertible matrices and b in IR^). Let be the 
operator defined on ^^(IR^) by 



co) 



1 

2 



n 

ca) 

1 



02 

dxfdxj 



n 

+ 'Y_bi{t,co) 
1 



a 

dxi 



( 11 ) 



Definition 5 Let r > 0, cdq ^ *^2. A probability measure Q defined on (Q , (^t)) 
is a solution to the path-dependent martingale problem for starting from coq at 
time r if 

Q({co G \co(u) = cjoo(u)y0 < u < r}) = I 
and if for all/ G ^^'^(IR-^ x IR^), and all t, (Z^’J^)r<t given by 

=f{t,Xr{co)) -f{r,Xr{co)) - [\-^ + co))(f){u, Xu{co))du (12) 

Jr ^ ^ 



is a (Q, (^t)) martingale. 
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In [3] we have studied the more general martingale problem associated with path- 
dependent diffusions with jumps. We have shown that the good setting to prove 
that the martingale problem is well posed is to deal with diffusions operators whose 
coefficients a and h are progressively continuous. 

Recall the following result from [3]. 

Theorem 1 7. Let a be a progressively continuous bounded function defined on 

7R+ X L2 with values in the set of non negative matrices. Assume that a(s,co) is 
invertible for all {s, (d). Let b be a progressively measurable bounded function 
defined on 7R+ x L2 with values in IRA. For all (r, coq), the martingale problem 
for starting from coq at time r is well posed i.e. admits a unique solution 

Gr’So of cddldg paths. 

2. Assume furthermore that b is progressively continuous bounded. Consider the 
set of probability measures ^\{L2) equipped with the weak topology. Then the 
map 

(r, ft), x) e ZR+ X X iR" ^ e ^i{Q) 

is continuous on [{r,co,x) \ co = co A- 



3.1 The Role of Continuous Paths 

In all the following means 

We start with a result which proves that the probability measure is supported 
by paths which are continuous after time r. 

Proposition 1 Every probability measure solution to the martingale problem 
for starting from coq at time r is supported by paths which are continuous after 
time r, i.e. continuous on [r, oo[. 

More precisely 

^ Vw < r, and cjo\[r,oo[ ^ oo[, IR^)} = 1 

Proof The probability measure 2^’^^ is equivalent with 2r,6t;o’ assume 

that b = 0. 

The function a is progressively continuous. This means that the function a is contin- 
uous. Let an be the ^ delayed function defined as dn(u, co,x) = d(u — ^,cl>,x) for 
all w > r -h i and dn(u, co, x) = d{r, a;, v) for all 0 < u < r F The function an 
is also progressively continuous. Given n, let t^ be an increasing sequence such that 
r" = r and ^ 

On a Polish space for every subsigma algebra of the Borel sigma algebra there 
exists a regular conditional probability distribution. It follows from [22] and the 
uniqueness of the solution for starting from co at time t^, that for all t^, 

for Qr%o almost all co. Let an,o)(u,x) = dn(u, co,x) 



156 



J. Bion-Nadal 



Let Ak — {co\ co'^^n ^ Given oo the function an,co is not path- 

dependent. It follows then from [11] that is supported by paths continuous on 
00 [. We remark that an{u, o)' ,x) = an^coiu, x) for all ^ < %"+2 and all cd' 

such that cd\u) = cd{u) for all u < It follows that Q^n ^{A^) = 1 for all co. We 

deduce by induction that Qr%o({cjo\ oo[ ^ = i- 

The an being uniformly bounded, for given r and o;o, the set of probabilty mea- 
sures {Qr%o,n G N*} is weakly relatively compact. There is a subsequence weakly 
converging to a probability measure Q. From the continuity assumption on a, it fol- 
lows that Q solves the martingale problem for starting from coq at time r. The 
uniqueness of the solution to this martingale problem implies that Q = Gr,ft;o’ 
set {cjo\ ^\[r oo[ ^ c>o[, is a closed subset of f2. It follows from 

the Portmanteau Theorem, see e.g. [2] Theorem 2.1, that G 

^([r, oo[,7R^)} = 1 

Corollary 1 For all continuous path o;o and all r, the support of the probability 
measure is contained in the set of continuous paths: 

= 1 

Remark 1 In the simpler case where the function a is only defined on the set of con- 
tinuous paths, the continuity hypothesis is just the usual continuity hypothesis for a 
function defined on 7R+ x IR^)) for the uniform norm topology. The asso- 

ciated martingale problem: probability measure solution to the martingale problem 
for starting from o;o at time r can only be stated for initial continuous paths coq 
(otherwise the path-dependent function a(u,co) should be defined for paths co which 
can have jumps before time u). 



4 Stable Set of Probability Measures Solution to a 
Path-dependent Martingale Problem 

In all the paper ^2 denotes the set of cMlag paths endowed with the Skorokhod topol- 
ogy. From now on, a{s,co)\^di given progressively continuous function on 7R+ x F2 
with values in non negative matrices. We assume that a is bounded and that a{s,co) 
is invertible for all {s,co). The explicit construction of dynamic risk measures devel- 
oped here, making use of probability measures solutions to a martingale problem 
was first initiated in the unpublished preprint [4] in the Markovian case. We have 
introduced in [5] a general method to construct time consistent convex dynamic risk 
measures. This construction makes use of two tools. The first one is a set ^ of equiv- 
alent probability measures stable by composition and stable by bifurcation (cf. [5] 
Definition 4.1). The second one consists in penalties cis^tiQ)^ s <t defined for every 
probability measure Q in <^, satisfying the local condition and the cocycle condition. 
The corresponding definitions are recalled in the Appendix. 
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4.1 Multivalued Mapping and Continuous Selector 



Definition 6 X denotes the quotient of 7R+ x x IR^ by the equivalence relation 
(t, co,x) ~ (t', co\x') if t = X = x' and co(u) = co\u)yu < t. The metric 
topology on X is induced by the one to one map from X into a subset of 7R+ xf2 x IR ^ : 

{t, (jO, x) (t, CO 

where a; x has been defined in Sect. 2.2 Eq. (3). 

The following observation is straightforward. 

Remark 2 The set X is equipped with the metric topology defined above. Then every 
progressively continuous map on 7R+ x Q defines a unique continuous map on X. 
Furthermore every map continuous on the subset {(r,co,x), co = co of 7R+ x 

X IR^ defines also a unique continuous map on X. 

Recall now the definition of a multivalued mapping from X to Y. We use here the 
terminology chosen in [9]. Notice that the terminology used in [1] for multivalued 
mapping is correspondence. 

Definition 7 A multivalued mapping A from X into IR^ is a map A defined on X 
such that for all (t, co,x) e X, A(t, co, x) is a. subset of IR^. It can have additional 
properties: 

1. A is convex if W(t, co, x) e X, A(t, co, x) is a convex subset of IR^. 

2. A is closed if for all (t, co, x), A(t, co, x) is closed. 

Recall the following definition of a continuous selector (Definition 16.57 of [1]). 

Definition 8 A selector from a multivalued mapping A from X into IR^ is a function 
s : X ^ IR^ such that s(t, co, x) e A(t, co, x) for all (t, co, x) e X. A continuous 
selector is a selector which is continuous. 

Recall the following definition from [1] (Definition 16.2 and Lemma 16.5): 

Definition 9 A multivalued mapping A from X into IR^ is lower hemicontinuous if 
it satisfies the following equivalent conditions 

• For every closed subset F of IR^, A^{F) = {(t, co,x) e X : A(t, co, x) C F} is 
closed 

• For every open subset V of IR^, A^(V) = {{t, co,x) e X : A{t, co,x) FV / 0} is 
open 

Recall the following Michael Selection Theorem (cf. 11] Theorem 16.61) 

Theorem 2 A lower hemicontinuous mapping from a paracompact space into a 
Banach space with non empty closed convex values admits a continuous selector. 

Recall also that every metrizable space is paracompact (Theorem 2.86 of 11]). 
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4.2 Stable Set of Probability Measures Associated to a 
Multivalued Mapping 

In all the following, A is a. closed convex lower hemicontinuous multivalued mapping 
from X into IR^. In the following (X^) denotes the canonical process on Q : For all 
cadlag path co, Xt(co) = co(t). is the canonical filtration. 

Given the progressively continuous matrix valued map a, given r > 0, and co e 
we want to associate to a stable set ^r,co{^) of probability measures on 
all equivalent with the probability measure ^ on Furthermore we want to 
construct a continuous function v on X. Therefore we start with continuous selectors 
X from A. 

Definition 10 Let a be progressively continuous bounded defined on 7R+ x ^2 with 
values in non negative matrices, such that a{t,co) is invertible for all (t,co). Let A be 
a closed convex lower hemicontinuous multivalued mapping from X into IR^. 

• We define L(A)to be the set of continuous bounded selectors from the multivalued 
mapping A. 

• For given r > 0 and co e ^2, the set ^r,co(^) is the stable set of probability 

measures generated by the probability measures X e L(A) with X(t, co') = 

X{t,co',Xt(co')) 

dO^.a-k .T I rT 

dQr,(jo Jr ^ Jr 

We give now a description of the set ^r,co{X^)- 

Definition 11 We define L{A)io be the set of processes JI such that there is a finite 
subdivision 0 = sq < •• < Sf < • • < Sk < oo. There is a continuous selector 

A.o,/o in L(A). And for all 0 < / < ^ there is a finite partition (Aij)j^j^ of into 
measurable sets, and continuous selectors Xij in L(A) such that 

VSi < U < 'ico' G Q , Jl(u, co' , x) = z Xij{u, co',x)lAij(a>') 

j^Ii 

Wsk < u Wco' G Q, JI(u, co' , x) — z Xkj(u,co',x)lAkj(co') 

j^h 

'iu < Sq, Wco' e Q, JI(u, co' , x) = Xqjq(u, co' , x) ( 13 ) 

Remark 3 Every process JI in L(A) is bounded strictly progressive and ^ x ^{IR^) 
measurable where ^ is the predictable sigma algebra. However there is no uniform 
bound. 
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Proposition 2 1. Let a be as above and Ji G L{A). For allr >Q and all co e Q, 

there is a unique solution to the martingale problem for starting from 

(jo at time r with p.(u, co) = JI(u, co, cjo(u)). Furthermore for all r < s, the 
map co' measurable and is a regular conditional probability 

distribution of given 3§s- 

2. Given 0 < r, the set ^r,co{2\) is the set of all probability measures for 
some process JL belonging to L{A). 

Proof Let ]I G L{A). There is a finite subdivision 0 = < • • • < ^/ < < 

' ' ' Sk < 00 such that ~jl is described by Eq. (13). Let r and co. We prove first by 
induction on k that there is a unique solution Q to the martingale problem for 
starting from co at time r and that Q belongs to ^r,cD{2\). 

For ^ = 0 the result is true by hypothesis. 

Inductive step: Assume that ^ > 1 and that the result is proved for ^ — 1. Let 2 be a 
solution to the martingale problem for starting from co at time r. Let Qsj,,co' be a 

regular conditional probability distribution of Q given ^Sk • From [22] it follows that 
for Q almost all co' in Aj^j, Qs^,co' is a solution to the martingale problem for 
starting from co' at time Sk- The martingale problem for posed. Let 

be the unique solution to the martingale problem for starting from 

co' at time Sk. It follows that Qsk,a)' = Q a.s. Thus for all 

j^h 

Eq{^\^sO = X WjE i^\^sO (14) 

jelk 

On the other hand the restriction of Q to ^Sk is a solution to the martingale problem for 
^a,av y ^ L(A) is associated to the subdivision (^/)o</<y^-i and v coincides 

with pi on • From the induction hypothesis it follows that the restriction of Q to 
is uniquely determined, it coincides with and it belongs to ^r,cD{2\). 

The end of the proof of the inductive step follows then from Eq. (14), from the ^Sk 

measurability of the map co' and from the definition of 

On the other hand it is easy to verify that the set {Qr%^ : JI ^ T(A)} is stable. □ 



5 Construction of Penalties 

In the preceding section we have constructed for all given {r,co) a stable set of 
probability measures ^r,co{2l) associated to a multivalued mapping A. In this section 
we construct penalties otstiQ) for all r < 5' < r and all Q G <^^,^(A) making use of 
a function g. 
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Let g : IR^ xf2x IR^ lRU{-\-oo} be a progressively measurable function. Let A be 
a closed convex multivalued Borel mapping such that for all (t,co,x) e IR^ xf2xlR^, 
{0} c A(t, CO, v) c {y e 1R^\ g(t, co v, y) < oo}. Define/ as follows: 

Wz € = sup (-z.y - git,a),y)) (15) 

yeA{t,a),Xt{a))) 

The following lemma is straightforward: 

Lemma 1 For all (t, co), f(t, co, .) is a closed convex function which is the dual 
transform of the function g(t,co, .) where 

g(t, CO, y) = g(t, CO, y) if y e A(t, co, Xt(co)) 

= +00 else (16) 

For every (t, co) dom(g(t, co, .)) = A(t, co, Xt(co)) 

If g(t, co,0) = 0 V(L co), f takes values in [0, oo]. 

Ifg takes values in [0, oo] and satisfies V(L co), infy^A{t,cD,Xt{co))g{h y) = 0 then 
for all (t, co), f(t, CO, 0) = 0. 

Notice that, since is a closed convex multivalued mapping, replacing g by g, one 
can always assume that for all (t, co), dom{g{t, co, .) = {y e IR^ \g(t, co, y) < oo} is 
closed, convex and equal to A (t,co, Xt(co)). We assume this in all the remainder. 

Definition 12 1. g satisfies the following polynomial growth condition (GCl) if 

there is ^ > 0, m G fV* and £ > 0 such that 

Vy € A(u,co,Xu(A), |g(M,u>,)')| </:(l+sup||X,(cw)||)'”(l + ||>-||2-^) (17) 

s<u 

2. g satisfies the growth condition (GC2) if there is ^ > 0 such that 

Vjeyl(M,cu,Z„(cu)), |g(M,«,j)| </:(l + ||y||2) (18) 

Recall the following definition of BMO processes. 

Definition 13 Let C > 0. Let R be a probability measure. A progressively measur- 
able process /x belongs to BMO(P) and has a BMO norm less or equal to C if for all 
stopping times r, 

/ oo 

\\l^s\fds\^r) < C 

Recall also from [15] that the stochastic exponential ^(/x) of a BMO process /x 
is uniformly integrable, and that the BMO norms with respect to P and P{(F{ii).) 
are equivalent. Also from [15], for all C > 0 there is 1 < /?o < co such that for all 
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BMO(P) process /x with \ \iJi\\BMO{P) < C, the stochastic exponential ^(/x) satisfies 
the reverse Holder inequality: 

< Kc^(iJi)s (19) 



Definition 14 Assume that g is non negative or satisfies one of the growth conditions 
(GCl) or (GC2). Let 0 < r < T. For all BMO{Q^ process fi A-valued, for all 
r < s < t < T, define the penalty ois,t(Qr%^) as follows 



t 

«5,r(Gr,T) = ^ j a>))du\^s) (20) 

We need to verify that the penalties are well defined for all BMO processes and that 
they satisfy the local property and the cocycle condition. (Definition introduced in 
[5], Definition 4.3 and recalled in the Appendix). 

Proposition 3 Assume that the process /x belongs to BMO(Q^ ^). Let C such that 
WP'llBMOiQf J < C. 

• 1. Assume that g satisfies the growth condition (GCl). Then Eq.(20) defines a 

random variable in Lp(Q^ co) all \ < p < oo, and for given p, the Lp{Q^ 
norms ofas^t(.Qr%^) uniformly bounded for r < s < t < T, for all pi such 

that ||/x||5M0(e^^^) - 

ois,t(QrlT) belongs also to Li{QpT) cmd the L\{QpT) norms ofas,t{Qr]T) 
are uniformly bounded for \ \pi\\BMO{Q^ S C and r < s < t < T. 

2. Assume that g satisfies ( GC2), then the random variables ots^tiQpol^) belong to 

^oo(Gr &») uniformly bounded for \ \pi\\BMO{Qy ^ C and r < s < t < 

T 

3. In case g is non negative, Eq.(20) defines a non negative random variable. 

• Assume that g satisfies the growth condition ( GCl ) or ( GC2 ). Then the penalty 
defined in (20) satisfies the cocycle condition for every Qr%^ : Let r < s < t < u 



O^sAQrlT) = «.,r(e":r) 



( 21 ) 



• The penalty defined in (20) is local on ^^^<^(A) 

• If g(t, (jo,0) = 0 V(L (X>) G IR^ X T2, The probability measure has zero 
penalty. 

Proof 1 . Assume that the function g satisfies the growth condition (GCl). Without 
loss of generality one can assume that m > 2. Choose pi > 1 such that (2 — 
s)pi =2. Let q be the conjugate exponent of pi . 
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It follows from the conditional Holder inequality and the equivalence of the 
BMO norms with respect to 2^ ^ and that 

t 

/ (1 +sup||X,(«)iniAt„(a))||2-^jM|i^,) 

’ J s<u 

s 

< Ki cn [EQa,aj . ( sup (1 + I |Xy I Ht-S)^ (22) 

s<s'<u 

Let po be such that Eq. (19) is satisfied for P = Let qo be the conjugate 
exponent of po- It follows from conditional Holder inequality and (19) that 

%,j(sup(l + \\XA\r‘^\SSs)) < KcEq^a sup (1 + 

s'<u s<s'<t 

< KcEqu^ ( sup (1 + I |Xy 1 1)"'®'® 1^,) ^ (23) 

s<s'<t 



for all j > 1. The first assertion of 1 of the proposition follows then from the 
Eqs. (22) and (23) and the inequality £’g«^(sup^<^<^(l + | 1 1)^)) < oo for all 

k>2( [16], Chap. 2 Sect. 5). 

The second asssertion of 1. of the proposition follows from Eq. (22) and then 
from Eq. (23) applied with equal to the trivial sigma algebra. 

2. Assume that g satisfies the the growth condition (GC2). Thus 

t 

/(I + 

The result follows then from the BMO condition. 

3. The case g non negative is trivial. 

• The cocycle condition (21) follows easily from the definition (20) and the above 
integrability. 

• We prove now that the penalty a is local on ^r,co{2\). Let JI,ve L(A). The 

probability measures and are equivalent to 2^^. Let r < s < t 

and A be -measurable. Assume that for all X in (X \ ^s) 1a = 

EQa,av(X\^s)^A- It follows from the equality J^^^Ia = J(^1a and the ^ x 
^(JR^) measurability of ^ and v that = ^\]s,t[^A^ Qr,o) (20) 

we get oisj(QrlT)U = oisjiQrlS’^U- Thus the penalty a is local on ,Jr,®(^)- 

• The last point follows easily from the definition of the penalty. □ 
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6 Time Consistent Dynamic Risk Measures Associated 
to Path-dependent Martingale Problems 

We change the sign in the classical definition of risk measures in order to avoid the 
minus sign which appears in the time consistency property for usual dynamic risk 
measures. In fact pst{—X) are “usual” dynamic risk measures. 



6.1 Normalized Time-Consistent Convex Dynamic Risk 
Measures 



Proposition 4 Let the stable set of probability measures defined in 

Definition 10. Assume that g is non negative, and that for all (u, co'), g(u, co' ,0) = 0. 
Let r < s < t. The formula 



Ps,t 



(7) = eSSSUpQa,a^^J^_^Q(EQa 



Q(EQa,a.(Y|^s)-«s,t(Qr;r)) 



(24) 



where cis,tiQr%^) Is given by Eq. (20) defines a normalized time consistent convex 
dynamic risk measure on L^(T2 , ^). 

For given 0 <r <t and Y in L^{Q^ , Q^y), the process {Y))r<s<t admits 

a cddldg version. 

Proof Notice that for all bounded 7, 



-ll^'lloo < EQa^^{Y\^s) < e Jr . 



Thus for all r < A < 7 \\Ps'f(Y)\\oo < 1 1 I loo- The first statement follows then 
from Definition 10, from Propositions 2 and 3 and from Theorem 4.4 of [5]. 

The proof of the regularity of paths which was given in [6] Theorem 3 for normalized 
convex dynamic risk measures time consistent for stopping times can be extended 
to normalized convex dynamic risk measures time consistent for deterministic 
times. □ 



We have the following extension of the dynamic risk measure to random variables 
essentially bounded from below: 

Corollary 2 The definition of pl'^(Y) can be extended to random variables Y (^t)~ 
measurable which are only essentially bounded from below. 

Ps]T OP) = lim^^oo a n). For every Y essentially bounded from below, the 

process (p^’^(Y)) is optional. 

Proof Let Y be ^^^-measurable and ^-essentially bounded from below, Y is the 
increasing limit of = Y An di^n tends to oo. Define as the increasing limit 

of p^’f'(F„). As we already know that for given a and t, p^’f'(F) defined on bounded 
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random variables by formula (24) is continuous from below, the extended definition 
coincides with the previous one on ^-essentially bounded random variables. 
From Proposition 4 for every n one can choose a cMlag version of the process 
Thus the map (i", a;) ^ p^’f*(F) = limp^’f’(y^) is measurable for the 
optional sigma algebra. □ 



6.2 General Time-Consistent Convex Dynamic Risk Measures 



In this section the function g (and thus the penalty) is not assumed to be non negative. 

Definition 15 Let g be a probability measure on (^2, {^t)teR+)- The multivalued 
mapping A is BMO(Q) if there is a map 0 G BMO(Q) such that 

W(u,co), suplllyll, y e A(u, co)} < (p(u, co) 

In the following, po is chosen such that the reverse Holder inequality (19) is satisfied 
for ( 20 . 

Theorem 3 Let (r, co). Assume that the multivalued set A is BMO(Q^ ^). Let 
be the stable set of probability measures defined in Definition 10. 

Let r < s < t. Let 



Ps,t 



(7) = esssupQa,a^gj^^Q(EQw(Y|^s) - «s,t(Qr;^'^)) (25) 



where cXs,t(Qr’,M^) is given by Eq. (20) 

• Assume that g satisfies the growth condition (GCl ). The above Eq. (25) defines a 

dynamic risk measure (p^’f*) on Lp(Q^ for all qo < p < oo, (where q^ is 

the conjugate exponent ofpo chosen as above). These dynamic risk measures are 
time consistent for stopping times taking a finite number of values. 

• Assume that g satisfies the growth condition ( GC2 ). The above Eq. (25) defines 

a dynamic risk measure (p^’f*) on i^t)) y ond also on every Lp(Q^^^, 

i^t)) for qo < p < oo. These dynamic risk measures are time consistent for 
stopping times taking a finite number of values. 

Proof There is a constant C > 0 such that for all gr e (21) , \ \api\ \bmo{Q^ a — 

C. It follows from the reverse Holder inequality (19), that for all non negative measur- 
able F, EQapaj,(Y\3Ss) < Kc{EQa^(\\Y\\^^\3Ss)^ Y .Egw(F|^,) defines 

a linear continuous map on Lp(Q^ with values Lp(Q^ for all qo < p < oo, and 
that for given p, the norms of these linear maps are uniformly bounded for A valued. 
From Proposition 3, it follows then that Eq. (25) defines a dynamic risk measure 
(Ps'f) on Lp(Q^ for ail qo < p < oo in case (GCl). Under assumption 
(GC2), equation (25) defines a dynamic risk measure (ps’f) on Lp(Q^^^, (^t)) for 
all Lp, qa < p < oo. 
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The time consistency for stopping times taking a finite number of values follows 
from the stability property of the set of probability measures as well as the cocycle 
and local property of the penalties (cf. [5] in Loo case). The proof is the same in Lp 
case. 



7 Strong Feller Property 



7.1 Feller Property for Continuous Parameters 



We assume that the progressively measurable function g is a Caratheodory function 
on 7R+ X ^2 X that is for all u, {cd, x) g{u, co, x) is continuous. The support 
of Qrlco is contained in the paths co' continuous on [r, oo[ (Proposition 1) and which 
coincide with a; on [0, r] . It follows that for every function X progressively continuous 
bounded, and all u > r, the function co' g(u, co' , X{u, co')) is continuous on the 
support of • We prove then the following Feller property for the penalty. 

Proposition 5 Let a be progressively continuous bounded such that a(s, co) is invert- 
ible for all {s,co). Let X be progressively continuous bounded. Assume that g is a real 
valued Caratheodory function satisfying the growth condition (GCl ) or (GC2). 

1. There is a strictly progressive real valued map L{g) on[0, t]xT2 x UC continuous 

on 1(5’, x), CO = CO such that 

E^a,ak / g{u, co' , X{u, co')du) = L{g){s, co,x) Vs < t CO and x (26) 

Us,0)*sX J ^ 

2. For all 0 < r < s < t, and co e Q, there is a Q^’^-null set N such that for all 
ooi e 



,,ax( [ g(u, co', X(u, co'))du\^s){cjo\) = L{g){s, coi,Xs(coi)) (27) 
J s 



Proof We only need to prove 1 . 

Step 1: Assume that the function g is bounded. Then g(u, co', X(u, co'))du is a 
continuous bounded function of co' on the support of Qf^. The map X being pro- 
gressively continuous bounded, the continuity property for L(g) follows easily from 
Theorem 1. 

Step 2: general case. Notice that X is bounded. 

Thus under assumption (GC2), g{u, co' , X{u, co') is uniformly bounded and the con- 
tinuity property of L(g) follows from step 1. 
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Under assumption (GCl), let (sn, cOn, Xn), con = cOn Xn with limit {s , co , x) , co = 
0 ) X. By definition the sequence cOn Xn has limit o) x. It follows from [3] that 
the set of probability measures ^ = {QHtcon*sn=c„' « e TV} U is weakly 

relatively compact and thus tight. Thus for all ^ > 0, there is a compact set such 
that < r] for all Q e From the growth condition (GCl), the existence 

of a uniform bound for Eq [ + sup^<^ | \Xu\\^)du]^ fovQe^ and the Holder 
inequality, it follows that there is a progressively continuous bounded function g\ 
such that for all Q in 

Eq I (\\g(u, CO, (ak)(u, co) — gi(u, co)\\))du < s. (28) 

The result follows then from step 1 . □ 

We introduce now a class of measurable functions on Q satisfying a continuity 
condition derived from the progressive continuity condition that we have introduced 
for progressive functions and from the continuity property proved in Theorem 1 . 

Definition 16 Let ^ > 0. The function h defined on belongs to % if there is a 
function hon x IR^ such that 

• h(co) = h(co Xt(co)) 

• h(co, x) = h(co\ x) if co(u) = co'(u) Wu < t 

and such that h is continuous bounded on {(a;, x), co = co^t^] C -G x IR^ 

Corollary 3 Assume that a is progressively continuous bounded and that a{s,(j)) 
is invertible for all {s, co). Let X be progressively continuous bounded. Let h G 
Asssume that the penalty as^t is given by Eq. (20) for some Caratheodory function g 
on lR-\- X X IR^ satifying the growth condition ( GCl ) or ( GC2 ). There is a strictly 
progressive map L^’^(h) continuous {(^/, a; x, x), Q <u <t] such that for all 
0 < ro < r < t, 

L^^^(h)(t, CO, x) = h(co, x) 

L‘^’\h)(r, CO, X) = E a.ax (h) - 

'^r,co*rX ’ ' 

= (h\^r) - arAQZk^](<^ V a.s. (29) 

Proof The result follows from Theorem 1 and from Propositions. □ 



7.2 Feller Property for the Dynamic Risk Measure 

Proposition 6 Let pc in L(A) (Definition 11). Let t > 0 and h e %. Asssume that the 
penalty as^ is given by Eq.(20)for some Caratheodory function g on 7R+ x T2 x HC 
satifying the growth condition ( GCl ) or ( GC2 ). There is a strictly progressive map 
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^ix{h) continuous on {{u, (d,x), co = co u < t] such that 0f^{h){t, co,x) = 
h(co, x)for all co, and such that for all r < s < t, there is a process Vs in L(A) such 
that 



o>, X) = EQa,^{h) - 



- a. 



$,CO*sX 



<l>n(h)(s, a, Q^r,coo 



and 



(30) 

(31) 



Q“Mh\^s) - C^sAQ^r’Z) ^ - ^sAQf‘r:Z) QZo 



(32) 



Proof The proof is done in two steps. The first step is the construction of 0^{h) 
given /X. The second step is the construction of Vg given /x and 5’. /x belongs to L(A). 
From Definition 11, let 0 = < •• < si < • • < Sk < Sk-\-i = oo be a finite 

partition such that Eq. (13) is satisfied. 

• First step: construction of 0^{h). We construct 0^{h) recursively on [si, 

Let Sn be such that Sn < t < ^^+1- From Corollary 3, for all j e In there is a 
strictly progressive map continuous on {{u, co,x), co = co u < t] 

satisfying Eq. (29). Let 

0^(h)(s, CO, x) = sup (h)(s, co, x) Vs e [Sn, t[ (33) 

j^In 



Let hn(co) — 0^(h)(sn, co,Xs^(co)). The function hn belongs to ^sn 

hn(co, x) = 0^{h)(sn, CO, x). Then we can proceed on , Sn[ with hn. We con- 
struct recursively the strictly progressive map 0^ (h) continuous on {(u, co, x), co = 
CO X, u < t}. Notice that for all 5' G [si, there are measurable sets 
(Csj)jeii, such that 

d>A^)C, 0 },x) = ^ \QA^)C'^‘Ahi+\){s, (a, x) (34) 

j^ii 



• Second step: Given 5’ g [r,t], construction of the process v^. 

There is a unique k such that 5’ e]sk, ^y^+i]. For i > k for all u gI^s-/, 5’/+i], define 

Vs(u, «) = ^ lQ.j{co)Xij{u, co) (35) 

j<^h 



And for u Gi^s-, 5'y^+i], define 

Vs(u, O)) = ^ lQj(co)kkj(u, co) 
jelk 



( 36 ) 
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and Vs(u,co) = /x(w, co) for slU 0 < u < s. It follows from the construction of Vs 
that the process Vs belongs to L(A). It follows also recursively that for all i > k 
and 0 ) e k2, 



EQa.^{h) - 



(37) 



and for all &> e Q, 



%:r x,{w)) 

By construction the following inequality is satisfied: 

EQa.aj.Ql) - ^ O), 

It follows then from Proposition 2 that 



EQaMh\j^,) - asAQZo') ^ %,C (^ 1 =^^) - ^sAQr’Z^) Q" 



r,coo 



(38) 



(39) 



and 












Theorem 4 Assume that the hypothesis of Theorem 3 are satisfied and that g 
is Caratheodary function. The time consistent dynamic risk measure (Ps’f)r<s<t 
defined on V^{T2 , co) Eq- (25) satisfies the following Feller property: For 

every function h G %, there is a progressive map R(li) on IRj^yiT2,R(h){t,(j()) = h(o)), 
such thatR(h) is lower semi continuous on {(w, a;, x), u < t, co = co such 

that the following equation is satisfied 

^s G [r, r], ^co' pff{h) = R{h){s, co') (40) 

VO < r < 5’ < ^, pI'^^ ( h){co') = R(h){s, co' , co' (s)) Qfi ^ a.s. (41) 

(R(h) denotes the strictly progressive map on 7R+ x T2 x IR^ associated to R(h) in 
the one to one corrrespondence introduced in Sect. 2 ). 

Proof For all pc G L{A), let d>^(/z) be the strictly progressive map, continuous 
on {(u,co,x), CO = CO u < t} constructed in Proposition 6. Let R{h) = 
^^P/x€L(yi) ^/x(^) -The function /?(/z) is then lower semi continuous on {(w, co,x), co = 
co^u^, M < t}. hQtR(h)(u, co) = R(h)(u, co,Xu(co)). Equations (40) and (41) follow 
easily from Proposition 6. 
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8 Existence of Viscosity Solutions for Path-dependent PDEs 
8.1 Existence of Viscosity Supersolutions 

Recall that A is a. closed convex lower hemicontinuous multivalued mapping from 
X into IR^ (Sect. 4.2). Let/ be the convex conjugate of g defined as 

f(u, 0 ), z) = sup (z*y - g(u, 01 , y)) (42) 

yeA{u,(i>) 

We prove now that for all h e % the map R(h) of Theorem 4 leads to viscosity 
solutions for the following semi-linear second order PDE. 

— duv(u, co) — Afv(u, co) —f{u, CO, a(u, co)DxV(u, co)) = 0 

v{t,co) =f(co) (43) 
Afv(u, co) — ^Tr{a{u, co)D^{y)(u, co)) 

Theorem 5 Fix (to, coq). Assume that the mutivalued set A is BMO(Qf^ ^^) (Def- 
inition 15). Assume that the function g satisfies the preceding hypothesis (g is 
a Caratheodory function and satisfies the growth condition ( GCl ) or ( GC2 ) ). 
Assume furthermore that g is upper semicontinuous on {(v, co, y), (s, co) e X, ye 
A(s, CO, o;(v)}. For all r and co, let (p^’f*) be the dynamic risk measure given by 
Eq. (24) where the penalty satisfies Eq. (20). 

Let h e %. The function R(h) is progressive and R(h) is lower semi continuous on 
{(u, CO, x), CO = CO u < t] (Theorem 4). R(h) is a viscosity supersolution of the 
path-dependent second order partial differential equation (43) at each point (to, coq) 
such that f (to, coo, a(to, coo)z) is finite for all z. 

Proof Let xo = coo (to)- From Theorem 4, the function R(h) is progressive and R(h) 

is lower semi continuous on {(u, co, x), co = co^u^^ u < t}. We prove first that R(h) 

is bounded from below on some De(^o, coo). R(h)(u, <^) > co^^) ~ ^ut(Qu cc) 

all given k > 2, £’ga^(sup^<^<^(l -h ||X^||)^)) is uniformly bounded for (u, co) e 

Dg(to, coo). The result follows then from either the (GCl) condition or the (GC2). 

— 10 2 

Let 0 progressive, 0 in X X IR^) such that c/)(to, coo) = ^(^)(^o, <^o) 

and (^ 0 , coo) is a local minimizer of R(h) — 0 on Dy^fo, coo) for some q > 0. 

• Step 1 : Continuity properties 

By hypothesis /(^o, coo, ci(to, coo)DxCp(to, coo, -^o)) < c>o. Thus for all e > 0, there 
is A .0 G A(to, coo) such that 



DxCp(to, coo,xofa(to, a)o)A-o - g(to^ ^0, ^o) >/(to, ^^0, ci(to, coo)Dx(t)(to, ^^0,-^0)) “ ^ 

(44) 

The multivalued set A is assumed to be lower hemicontinuous. It follows that 
for all ^ > 0, Ax is also lower hemicontinuous, where Ax(n,co,x) = {z e 
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A(u,co,x), ||z|| <^}. Choose^ > ||Ao||. From Theorem 2, there is thus a contin- 
uous bounded selector X(u, co, x) of Ak defined onX such that k(to, coq, xq) = Xq- 
From the upper semi-continuity condition satisfied by g, and the continuity of the 
map X, it follows that for all e > 0, there is t]\ >0, such that for to < u < t < 
to + rii,d(co,ct)o) < m and \\(jo{t) -xo|| < m, 

g(t, CO, k(t, CO, &>(?))) - g(to, coo, ^o) < e ( 45 ) 

From the continuity of the function X, the hypothesis 0 G ’ ’ and the progres- 
sive continuity of a, there is r]2 such that for S u < t < to ^2, I k — -^o 1 1 < ^2 
and d{co <^o) < ^2, 



— i 2 — _ — t — — 

\du<p(u, co,x) -\- - Trace (D^(pa(u, co, x)) -h ciX)(u, co,x) — 

- 1 2-- -t- 

du(p{to, coo, ->^ 0 ) + -TraceiD^4>a){to, coo, -^o)) + Dx<p a(to, coo, ->ro)^ol < « (46) 

The maps a and X are bounded. It follows from [22] that there is 0 < a < 
infirj, rji, t] 2 ) such that 

A = {co\ sup ||m(M) - cuo(to)ll > infill, rn, r] 2 ) } 

(47) 

Let C = {co, co(u) = coo(u) VO < u < to, sup^o<u<ro+a “ t^o(fo)ll < 

infiri, rn, r] 2 )} 

• Step 2: Time consistency 

For all 0 < ^ < Ck(, let 5 be the stopping time 8 = The stopping time 8 takes 
only 2 different values. By definition of the probability measure it follows 

from (47) that Q'^^^iC) > (1 - e). 

The dynamic risk measure (Pu,'^^)o<u<v, is time consistent for stopping times 
taking a finite number of values, thus 

= (48) 

From Theorem 4, the lower semi continuous function R(h) satisfies: 

(h)(co) = R(h)(to + S,co,X,,+si 0 })) a.s„ and pl^lT^ih) = R{h){to, coo). 

Let X be the continuous function defined in step 1. By hypothesis R(h) > 0 on 
Dyj(to, coo). For all a; G C and u < p, (u,co co(u)) G Dyj(to, coo). Since the 
functions R(h) and 0 are progressive it follows that R(h) > 0 on [^o, to+ C. 
It follows then from the equality 0(^0, coo) = coo) and from the definition 

of that 



f 

4>{to,coo) > E a,ak [<t>{to + 8, CO, X,^+s(to)) - / 

Jtn 



to -\-8 



to 



g(u, CO, X(u, CO, Xu(co)))du] 

( 49 ) 
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• Step 3: Martingale problem 

The probability measure is solution to the martingale problem for 

starting from coq at time ^o- The function 0 is strictly progressive and belongs to 
10 2 

X Q X IR^). It follows from the martingale property proved in [3] that 

r^o+s _ I 

0 > E^a.ax [ / idu4>(u, CO, Xu(co)) + -Trace((Dl4)a)(u, co, Xu(co))))du] 

Jto 2 

rto~\-8 ^ _ _ 

+ E^a,ax [ / (Dx4> ak)(u, CO, Xu(co)) — g(u, CO, k(u, CO, Xu(co)))du] (50) 

JtQ 

• Step 4: Conclusion 

Divide (50) by ^ and let ^ tend to 0. It follows from the definitions of C and 3 and 
the Eqs. (44)-(46) proved in step 1 and 2^0 ^q(C) > \ — s that 

-du(t){to,coo,xo) - ^(l){to,coo,xo) - f{to, coq, cj\to, coo)Dx(p(to, coq, xq)) > -2e(l-6:) 

This gives the result. □ 



8.2 Existence of Viscosity Subsolutions 

In this section we will assume that the set A has some uniform BMO property. 

Definition 17 The multivalued mapping A is uniformly BMO with respect to a if 
there is a non negative progressively measurable map cp and C > 0 such that for all 



and such that for all {s' , co'), the unique solution to the martingale problem 
starting from co' at time satisfies: 



Of course the above condition is satisfied if sup^ cp{s, co)^ds < oo. 

The name “uniform BMO” property is justifed by the following result. 

Lemma 2 Assume that the multivalued mapping A is uniformly BMO with respect 
to a. Then for all {r, co) and all process pc A valued such that JI is ^ x ^(IR^) 
measurable, pc belongs to BMO {Q^ ^) and \ \pi\\BMO{Qf j S C 

Proof Let r, co and a stopping time r > r. It follows from [22] and from uniqueness 
of the solution to the martingale problem for starting from co' at time r {co') that 



sup{|A.|, k G A{s, co)} < cp{s, co) 



(51) 




(52) 
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for 2^ ^ almost all co' , 

/ OO rOO 

(p{u, 0 ))^ du\^T;){a)') = Eqa^ (J (p{u, co)^du) < C □ 

Let h e %. The function R(h) is lower semi continuous in viscosity sense but it is not 
necessarily upper semi continuous. Therefore we need to introduce the upper semi 
continuous envelope of R(h) in the viscosity sense according to Sect. 2.3. Denote it 
R(hT. 

R(h)*(s, co) = limsup{R(h)(s' , co'), (s , co') e Dr^{s, co)} 

r ]—^0 



Theorem 6 Let (to, coq) be given. Assume that the mutivalued set A is uniformly 
BMO with respect to a. Assume that the function g is a Caratheodory function sat- 
isfying the growth condition (GCl ) or (GC2) and that the Fenchel transform f ofg 
is progressively continuous. 

Let h G %. The map R(h)* is progressive, R(h)* is upper semicontinuous in viscosity 
sense. R(hY is a viscosity subsolution of the path-dependent second order partial 
differential equation (43). 



Proof Let xq = coo(to)- The progressivity of R(h)* follows from the equality 
Dn(s,co) = Dy^(s, CO ^(s))> The upper semicontinuity property follows from 
the definition of R(hY . We prove first that R(h) is bounded on some Dg(to, coq). 
R(h)(u, co) = sup^(£ga.«M(/!) - au,t{Qu,T))- Forgiven k>2, £:ga^(sup^<„<r(l + 
I l^f 1 1)^)) is uniformly bounded for (u, co) e £>«. The result follows then from either 
the (GCl) condition or the (GC2), and from the uniform BMO hypothesis with 
similar arguments as in the proof of Proposition 3. 

Let 0 progressive, 0 g ’^(^+ x ^2 x IR^) such that (p(to, coq) = R(h)*(to, coq) 
and such that (^o, <^o) is a maximizer of R(h)* — 0 on Dr^(to, coq). 



• Step 1: Making use of the progressive continuity property of a, f and of the 
regularity of 0, for all n e N*, there is > 0, ^ such that for 

to <U < to T]n, d(cOo, co) < Y]n and ||vo co(u)\\ < 



f(u, CO, (aDjc(j))(u, CO, co(u))) <f(to, coo, (aDjc(j))(to, + - (53) 

n 



i ry _ 

\du<p(u, CO, co(u)) + -Trace(D^(j)a)(u, co, co(u))) — 

1 2 -- 1 
dufito. m, -^o) + -Trace(D^(j)a)(to, m, -^o))l < - (54) 

2 n 

Without loss of generality one can assume < h- 

The matrix valued process a being bounded, it follows from [22] that there is hn 
such that for oil to < s < to + q and co G Dj^fto, coo). 
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sup \\w\u) - o)(5)|| > y}) < e (55) 

S<U<S+hn ^ 



Without loss of generality one can assume that hn < 

For all ^ > 0 choose {tn, con) ^ Dm (to, coq) such that lim^^oo = 
R(h)*(to, cjoo) and 

0fe, COn) < R(h)(tn, 0)n) + ~ (56) 

n 

Let C„ = (ft)', ^^Vtn<u<tn+K - o^nitn)\\ < Y, £u'(v) = cy„(v), Vv < f„}. 

It follows from Eq. (55) that > 1 — £ for all n. From Lemma 2 there is a 

constant C' > 0 such that | \ati\\BMO{Q^ S C' for all /jl g L(A) and all n. From 
[ 1 5] it follows that there is po such that the reverse Holder inequality ( 1 9) is satisfied 

for all (o(ap.) and thus that for all /x G L(A), and all n, 2^J,^(Q) < Kc(s)^o 
where qo is the conjugate exponent of po. The constants qo and Kc depend neither 
on pi nor on n. Thus s can be chosen such that (Q) > 5 for all n and all 
/X G L(A). 

Let be the stopping time taking only two values hn and 0 defined by = hnlcn- 
• Step 2: Time consistency 

Making use of the time consistency of the risk measure (pu'^''), and of the fol- 
lowing equations deduced from Theorem 4, 



Rmtn,(an)=ptP(h) 

it follows that 

R(h)(tn, OJn) = Pl::^;^s^^(h)(tn + .)) 

It follows from the definition of the risk measure ), that for all n there is 

^^tn,tn+On^^ 

a process /x„ in L(A) such that 

R{h){tn, «„) < {R{h){tn + Bn, •)) ~ «r„,r„+5„ + ~ (57) 

^tn,(On n, n 

For all co' G C„, (tn -\-hn,(jo' + hn)) G Dr^(to, coq). The functions R(h) 

and 0 are progressive and satisfy R(h) < R(h)* < 0 on Dr^(to, coq). It follows that 

^(h)(tn + ^') ^ 0(^n + ^') ^ (58) 

From Eqs. (56)-(58), it follows that 



(/)(tn, (On) — E Qa,aiPn\(f) (tn ~\~ Sn, •) 




g(u, CD , pin(u, Co'))du] 



hn 

-h 2— 



n 



(59) 
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• Step 3: Martingale problem 

Given (tn, con), the probability measure solution to the martingale prob- 
lem starting from con at time tn. The strictly progressive function 0 belongs 

to It follows from [3] and from Eq. (59) that 



rtn+^n ^ 

0 < EQa,ann [ / {du4>{u, oJ , {(jL>\u))) -\~ -Trace(Dl(l)a)(u, co' , Ci>'(u)))du] 

^tn,(i>n 2 

rtn+^n ^ 

+ E(^a,atin [ / {D^cj) (u, co' , co' {u))a{u, co')iin{u, co') ~ g(u, co' , /x„(w, co')))du] + 2 — 

^tn,a>n 

(60) 



By definition off it follows that 



ptn~\~8n 

[/ 

dtn 



0 < EQa,aixn [ I (9^0 (w, co' , co' (u))) H — Trace{{Dl(l)a){u, co' , co'(u)))du] 

^tn.COfi I / 



+ Er 



a,aixn [ / 
tn,o)n E 
^ Cn 



f(u, co', (dDx<p){u, co' , co' {u)))du\ -h 2 — 



( 61 ) 



• Step 4: Conclusion 

Divide equation (61) by hn and let n tend to oo. The result follows from step 1, 
the inequality > \ for all n and 8n = hnlcn' ^ 



8.3 Existence of Viscosity Solutions on the Set 
of Continuous Paths 



On the set of continuous paths ^ (7R+ x IR^) we consider the uniform norm topology. 
In this section we assume that the function a is only defined on 7R+ x "^(7R+ x IR^) 
and that it is continuous. For every continuous function h on the space of continuous 
paths ^(lR-\- X IR^) such that h(co) = h(co') if coin) = co'{u) for all u < t, the 
corresponding function R(h) is constructed as above. In this case the function R(h) is 
defined only on the set of continuous paths (more precisely on [0, t]x^ (7R+ x IR^)). 
We make use of the definition of viscosity solution on the set of continuous paths 
introduced in Sect. 2.4 (Definition 4). To prove that R(h) is a viscosity supersolution, 
to define R(h)* and prove that it is a viscosity subsolution, we do not need to extend 
the functions a nor R(h). Indeed as the support of the probability measure ^ is 
contained in the set of continuous paths, we just need to use the restrictions of the 
function 0 and of its partial derivatives to the set 7R+ x "^(7R+ x IR^) x IR^. The 
proofs given for Theorems 5 and 6 can be easily adapted to prove the analog result 
in the setting of continuous paths. 

Thus the setting of continuous paths can be considered as a “particular case” of the 
setting of cMlag paths. 
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9 Conclusion and Perspectives 



We have introduced new notions of regular solutions and viscosity solutions for path- 
dependent second order PDEs, Eq. (1), in the setting of cadlag paths. In line with the 
recent literature on the topic, a solution of (1) must be searched among progressive 
functions, that is path-dependent functions depending at time t on all the path up to 
time t. However, the notions of solutions introduced in the present paper differ from 
previous notions introduced in the literature on two major points: 

• In contrast with other papers, we consider on the set Q of cadlag paths the Sko- 
rokhod topology. ^2 is thus a Polish space. This property is fundamental for the 
construction of solutions for path-dependent PDE that we give in the present paper. 

• The notions of partial derivative for progressive functions that we introduce are 
defined in a very natural way by considering a progressive function of two variables 
as a function of three variables. 

In addition we introduce also a notion of viscosity solution on the set of continuous 
paths. 

Making use of the martingale problem approach to path-dependent diffusion 
processes, we then construct time consistent dynamic risk measures The stable 
set of probability measures used for the construction of is a set generated by 
probability measures solution to the path-dependent martingale problem for 
starting form oo at time r. The path-dependent progressively continuous bounded 
function a is given and takes values in the set of invertible non negative matrices. 
The path-dependent functions /x are progressively continuous and vary accordingly 
to a multivalued mapping A. This construction is done in a very general setting. In 
particular the coefficients /x are not uniformly bounded. We just assume that they 
satisfy some uniform BMO condition. To construct the penalties, we make use of a 
path-dependent function g satisfying some polynomial growth condition with respect 
to the path and some L 2 condition with respect to the process /x related to the BMO 
condition. In contrast with the usual setting of BSDE, in all this construction no Lip- 
schitz hypothesis are assumed. Notice however that the Lipschitz setting can also be 
studied within our approach: instead of starting with progressively continuous maps 
a and /x, one could start from a subfamily of maps which, for example, satisfy some 
uniform continuity condition (as K Lipschitz maps). 

We show that these risk measures provide explicit solutions for semi-linear path- 
dependent PDEs (2). Eirst, we prove that the risk measures satisfy the following 
Eeller property. Eor every function h measurable having some continuity property, 

there is a progressively lower semi-continuous function R{h) such that ph^{h{Xt)) = 
R{h)(r, 00 ) and R(h)(t, co) = h(t,co). Next, the function R(h) is proved to be a 
viscosity supersolution for a semi-linear path-dependent PDE (2), where the function 
/ itself is associated to g by duality on the multivalued mapping A. We prove also 
that the upper semi continuous envelope of R(h) is a viscosity subsolution for the 
path-dependent semi linear second order PDE (2). 

Here we have proved the progressive lower semi continuity for R(h). To prove the 
progressive continuity property, additional hypothesis should be added, e.g. Lipschitz 
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conditions. Another way of proving the continuity is to apply a comparison Theorem. 
The study of comparison theorems and of continuity properties in this setting, as well 
as the study of solutions to fully non linear path-dependent PDE will be the subject 
of future work. 
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Appendix 

An important way of constructing time consistent dynamic risk measures is to con- 
struct a stable set of equivalent probability measures and to define on this set a penalty 
which is local and satisfies the cocycle condition [5]. Recall the following definitions 

Definition 18 A set ^ of equivalent probability measures on a filtered probability 
space (^2, (^t)) is stable if it satisfies the two following properties: 

1 . Stability by composition 

For all 5' > 0 for all Q and R in there is a probability measure S in ^ such 
that for all X bounded ^-measurable, 

Es(X) = Eq(Er(X\^,)) 

2. Stability by bifurcation 

For all 5' > 0, for all Q and R in for all A e there is a probability measure 
S in ^ such that for all X bounded ^-measurable. 



Es(X\^s) = IaEq{X\^s) + IacEr(X\^s) 



Definition 19 A penalty function a defined on a stable set ^ of probability measures 
all equivalent is a family of maps (as,t), s < t, defined on ^ with values in the set 
of -measurable maps such that 

(i) a is local: 

For all Q,R in <^, for all 5', for all A in the assertion IaEq{X\^s) = 
IaEr(X\^s) for all X bounded measurable implies that lA0is,t(Q) = 
Uas^tiR)- 
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(ii) a satisfies the cocycle condition: For 2i\\r < s <t, for all Q in 

<^r,r(2) ~ ^r,s(.Q) H” r) 

Recall the following result from [5]. 

Proposition 7 Given a stable set ^ of probability measures and a penalty (cts^t) 
defined on ^ satisfying the local property and the cocycle condition, 

Pst{X) = esssupQ^^(EQ(X|^s) - «st(Q)) 

defines a time consistent dynamic risk measure on {^t)) or on Lp(k2 , 

(^t)) if the corresponding integrability conditions are satisfied. 
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Pricing CoCos with a Market Trigger 

Jose Manuel Corcuera and Arturo Valdivia 



Abstract Contingent Convertible Bonds, or CoCos, are contingent capital instru- 
ments which are converted into shares, or may suffer a principal write-down, if certain 
trigger event occurs. In this paper we discuss some approaches to the problem of pric- 
ing CoCos when its conversion and the other relevant credit events are triggered by 
the issuer’s share price. We introduce a new model of partial information which aims 
at enhancing the market trigger approach while remaining analytically tractable. We 
address also CoCos having the additional feature of being callable by the issuer at a 
series of pre-defined dates. These callable CoCos are thus exposed to a new source 
of risk — referred to as extension risk — since they have no fixed maturity, and the 
repayment of the principal may take place at the issuer’s convenience. 

Keywords Contingent convertible • Coco bond • Callable bond • Extension risk 



1 Introduction 



The Basel Committee on banking Supervision was created in 1974, after the collapse 
of the German Bank Herstatt, with the aim of establishing prudential rules of trading. 
During the 1980s this committee was concerned with the big moral hazard of Japanese 
banks that distorted the competition among countries. In 1988 it formulated a set of 
rules, so called Basel I, to stabilize the international banking system. Basically the 
main rule was that each bank should hold a minimum of 8 % of its total assets, where, 
for the valuation of the assets it was used some weights reflecting the credit risk of 
each asset. These measures produced a credit crunch and some criticism appeared, 
mainly related with the weights used to measure the risk of the different assets. 
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These weights only took into account the kind of institution borrowing or issuing the 
security and not what the spreads observed in the market. To amend the first Basel 
accords and take into account the market risk and interest-rate risk, it was started a 
process that concluded in 2004 with new rules, Basel II. These agreements are more 
complex and consider not only new rules for capitalization, with the introduction 
of VaR methodology, but also supervision and transparency rules. The regulator 
calculated the weights on the basis of the formula 



where 0 is the CDF of the standard normal distribution, LGD is the loss in case 
of default, PD is the probability of default, and R is the correlation between the 
portfolio of loans and a macroeconomic risk factor, see [24] for an explanation of its 
underlying model. To determine the different parameters, banks were allowed to use 
their own models. 

In 2007 a financial crisis, originated in the U.S. home loans market, quickly spread 
to other markets, sectors and countries, forcing the Federal Reserve and the European 
Central Bank to intervene in response to the collapse of the interbank market. This 
gave rise, in 2010, to new regulation rules, known as Basel III, that would change 
the financial landscape. Some securities were not going to be allowed anymore as 
regulatory capital and supervisors put emphasis in the fact that capital regulatory 
should have a real loss absorbing capacity. This is when Contingent Convertibles 
{CoCos) started to play an important role. 

In 2002 Flannery proposed and early form of CoCo that he called Reverse Con- 
vertible Debentures, see [22]. The idea was that whenever the bank issuing such 
debentures reaches a market-based capital ratio that is below a pre-specified level, a 
sufficient number of said debentures would convert into shares at the current market 
price. Later, in [23], he updated the proposal and named these assets as Contingent 
Capital Certificates. The idea behind was in agreement with what [19] wrote in 
The Prudential Regulation of Banks. In this work they formulated the representation 
hypothesis. According to this hypothesis prudential regulation should aim at replicat- 
ing the corporate governance of non-financial firms, that is, acting as a representative 
of the debtholders of bank, regulation should play the role of creditors in nonfinancial 
institutions. 

A Contingent Convertible is a bond issued by a financial institution where, upon 
the appearance of a trigger event, related with a distress of the institution, either an 
automatic conversion into a predetermined number of shares takes place or a partial 
write-down of the bond’s face value is applied. It is intended to be a loss absorbing 
security in the sense that in case of liquidity difficulties it produces a recapitalization 
of the entity. 

Basel III, among other regulating measures, proposed the inclusion of CoCos as 
part of Additional Tier 1 Capital, where Tier 1 is, roughly speaking, the capital or the 
assets that the entity have, for sure, in case of crisis, and consists of Common Equity 



K = LGD X 0 




( 1 ) 
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Tier 1 and Additional Tier 1. Chan and van Wijnbergen [5] affirm that the inclusion 
of Cocos in Tier 1 is a likely factor in the increase of C6>C6> issuances. In December 
2013, the C(9C(9 market had reached $49 bn in size in Europe. 

It is a controversial issue if CoCos are a stabilizing security. Koziol and Lawrenz 
[28] show that, under certain modelling assumptions, if CoCos are part of the capital 
structure of the company equity holders can take more risky strategies, trying to 
maximize the value of their shares. In their work Koziol and Lawrenz use a low 
level of the asset price of the company as a trigger for the conversion. Chan and 
van Wijnbergen [5] point out that conversion can be seen as a negative signal by 
the depositors of a bank and to produce bank runs. They also argue that far from 
lowering the risks, CoCos can increase even the systemic risks. On the other hand 
[20] defend that CoCos is an appropriate solution that does not lead to moral hazard 
provided that conversion is tied to exogenous macroeconomic shocks. 

There is also disagreement about how to establish the trigger event. It is perhaps 
the most controversial parameter in a CoCo. Some advocate conversion based on 
book values, like the different capital ratios used in Basel III. Others defend market 
triggers like the market value of the equity. So far the CoCos issued by the private 
sector are based on accounting ratios. 

The market for contingent convertibles started in December 2009 when the Lloyds 
Banking Group launched its $13.7 bn issue of Enhanced Capital Notes. Next in line 
was Rabobank making its first entry in the market for contingent debt with a € 1 .25 bn 
issue early 2010. After this, things turned quiet until Lebruary 2011, when Credit 
Suisse launched its so-called Buffer Capital Notes ($2 bn). This Credit Suisse issue 
was done on the back of the new regulatory regime in Switzerland. This was called 
the “Swiss Linish” and it required the larger banks such as UBS and Credit Suisse 
to hold loss absorbing capital up to 19 % of their risk weighted assets, see [1 1]. This 
capital had to consist of at least 10% common equity and up to 9 % in contingent 
capital. In 2014 a number of banks issued CoCos, including Deutsche Bank and 
Mizuho Linancial Group. 

Lrom a modelling point of view and sometimes depending on the trigger chosen 
for the conversion, usually a low level of a certain index related with the asset, the 
debt or the equity of the firm, one can follow an intensity approach or a structural 
approach to model the trigger. Lor an intensity approach for modelling the conversion 
time, see for instance [8, 16]. This approach is especially useful when pricing CoCos 
is the main interest, it is a kind of statistical modelling of the trigger event. In fact 
what one models is the law of the conversion time. In the structural approach for 
modelling the trigger, one models the random variable describing the conversion 
time and one relates it with the dynamics of the assets, debt, or equities. It is a more 
explanatory approach, where one can use the observed dynamics of certain economic 
facts to describe the conversion time. 

In the structural approach one can use a market trigger based on a low level of 
the equity value. This approach is very appealing because the market value of equity 
is an observable economic variable whose dynamics can be modelled in order to fit 
historical data. At the same time it allows to obtain close pricing formulas, like in 
[13], and to define an objective trigger that can be observed immediately. Cheridito 



182 



J.M. Corcuera and A. Valdivia 



and Xu [7] also use this trigger and show that pricing and hedging problems can be 
treated for quite general continuous models and barriers and that solutions can always 
be obtained, at least numerically, using Feymann-Kac type results, translating the 
problem of pricing into a problem of solving a series of parabolic partial differential 
equations (PDE) with Dirichlet boundary conditions. 

One argument against accounting triggers is that monitoring is not continuous, 
there is always a delay in the information. Moreover, in the recent crisis these triggers 
did not provide any signal of distress in troubled banks. On the contrary, when using 
market triggers, there exist the risk of market manipulations of the equity price trying 
to force the conversion or undesirable phenomena like the death-spiral ejfect. In [17] 
authors propose a system of multiple triggers to avoid the death spiral, whereas in 
[13] a system of coupon cancellations is proposed in order to alleviate this effect. 
Sundaresan and Wang [39] analyze this kind of trigger and find that to use low equity 
values as trigger is not innocuous. It can have destabilizing effects in the firm. Their 
reasoning is roughly speaking the following. Suppose that {At)t>o represents the 
aggregate value of the assets of the company, (Pt)t>o its aggregate debt, (Q)^>o 
the aggregate value of the CoCos, and the aggregate liquidation value of the 

CoCos issued by the firm. Set r for the conversion time, and assume that it happens 
when the (aggregate) equity value, {Et)t>o is lower that some level say (i7^)^>o. Since 
equity value is the residual value of the asset, at any time t < r, we will have two 
possibilities: 

At-Dt-Ct>Htift<T ( 2 ) 

or 

At-Dt-Lt <Ht iU = T. (3) 

This gives that, at any time t <x, 

Df -\- Ct < At < Df Lf. (4) 

So, if Q > Lf there is not any possible value for At and if Q < Lt there are 
multiple values, all of these allowing, according to [39], potential price manipulation, 
market uncertainty, inefficient allocation and unreliability of conversion. Obviously 
if Ct = Lt, that is, if there is no jump on the wealth of CoCo's investors at the 
conversion time, then the equilibrium is possible, but this is considered non realistic 
and even problematic, since a punitive conversion for shareholders could help to 
maintain the market discipline. As a possible remedy to this situation [32] propose a 
trigger based not only in the value of the equities but also in the value of the CoCos. 
They also propose to include, in the CoCo contract, a PUT option of the issuer on 
the equities, in case of conversion, to avoid market price manipulations. 

Another possibility, in the structural approach, is to use a low value of the asset 
value as a trigger. It is also quite appealing, since it allows to consider the whole 
capital structure of the company, to study the effect of Co Co debt in the equity value 
and to obtain the optimal conversion barrier for the shareholders. See for instance 
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[6, 28] or [2]. In this latter paper they also consider other triggers aiming at providing 
a proxy for regulatory triggers. 

Nevertheless in all these papers authors consider a fixed maturity of the CoCo 
bond. However bonds often do not just have a legal maturity but can have also 
different call dates. In such cases, the bond can be called back by the issuer at these 
dates prior to the legal maturity. This risk of extending the life of a contract is what we 
call extension risk. This has been treated for the first time in [18] using an intensity 
approach and in [14] using a structural approach. 

In this paper we review our work on the topic of pricing CoCos, and we introduce 
new issues like delay in the information and jumps. We always consider low values of 
the stock as triggers and CoCos that convert totally into equities in case of conversion. 
The paper is organized as follows. The contract features are specified in Sect. 2. In 
addition, a model-free formula for the CoCoprice is presented in order to establish the 
general pricing problem. In Sect. 3 a model with stochastic interest rates is studied. 
A closed-form formula for the price is given and subsequently used in order to 
study the Black-Scholes model and its Greeks. In Sect. 4 two advanced models are 
discussed. On the one hand, the stochastic volatility Heston model is incorporated 
to the share price dynamics, and the correspondent prices are later on obtained by a 
PDE approach. On the other hand, an exponential Levy model is proposed, and the 
obtainment of the correspondent prices is addressed by a Fourier method exploiting 
the so-called Wiener-Hopf factorization for Levy processes. In Sect. 5 we introduce 
a new trigger model which aims to describe the delay of information present in 
accounting triggers. Finally, in Sect. 6 we show how the original pricing problem is 
modified when no fixed maturity is imposed on the CoCo. This variation leads to 
what we call CoCos with extension risk, and the pricing problem includes solving 
an optimal stopping time problem which, even in the Black-Scholes, turns out to be 
far from straightforward. 



2 The Pricing Problem 



The definition of a CoCo requires the specification of its /ace value K and maturity 
T, along with the random time r at which the CoCo conversion may occur, and 
the prefixed price Cp at which the investor may buy the shares if conversion takes 
place. We refer to r and Cp as conversion time and conversion price, respectively. 
The quantity Cr K / Cp refered to as conversion ratio. Assuming m coupons 
are attached to the CoCo, then we further need to specify a series of credit events 
that may trigger a coupon cancellation. Denote by ri, ..., the random times at 
which the aforementioned credit events may occur. Then the whole coupon structure 
(cj, Tj, of the CoCo is defined, in such a way that the amount cj is paid at 

time Tj, provided the zj > Tj. We establish that the last coupon is paid at maturity 
time, i.e.,Tfn := T. The coupon cancellation feature was introduced in [13] in order 
to alleviate the so-called death-spiral ejfect exhibited by the traditional CoCo — see 
details in Sect. 3.1. Thus it is assumed that coupon cancellation precedes conversion 
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according to ri > • • • > > r . Of course it suffices to set ri = • • • = = oo if 

this feature were to be excluded from modelling. 

It will be assumed that the issuer of the CoCo pays dividends according to a 
deterministic function /c. From the investors side, it seems to be reasonable to assume 
that no dividends are paid after the conversion time r . Thus hereafter we shall assume 
that the following condition holds true. 

Condition (F). There are no dividends after the conversion time r. 

Remark 1 It is worth mentioning that, whereas this condition simplifies the expres- 
sions obtained for the price, it is not crucial, in the sense that the computations still 
can be carried on. We refer to [13] for a further discussion on this topic. 

Once these features are settled, the Co Co's final payoff, is given by 



where (St)t>o and (rt)t>o stand for the share price and interest rate, respectively. 



2.1 A Model-Free Formula for the CoCo Price 

In Proposition 2, a model-free formula for the CoCo price is given. Let us first 
introduce some notation required for what follows. Underlying to our market, we 
shall consider a complete probability space (^2, P), endowed with a filtration 

F := (^Ore[0,r] representing the trader’s information — this includes the information 
generated by all state variables {e.g., share price, interest rates, total assets value,...) 
and the default-free market. All filtrations considered are assumed to satisfy the usual 
conditions of P-completeness and right-continuity. We shall denote the evolution of 
the money in the bank account by ke.. 



Recall that two probability measures on (^2, ^), Pi and P 2 , are said to be equiv- 
alent if, for every A e Pi (A) = 0 if and only if P 2 (A) = 0. We shall assume 
the existence of a risk-neutral probability measure P*, equivalent to the real-world 
probability measure P, such that the discounted value of self-financing portfolios, 
(Vt := ^)?e[0,r], follows a P* -martingale. Hereafter the symbol tilde will be used to 
denote discounted prices. Now, in addition to P*, we shall consider other two prob- 
ability measures (also equivalent to P) which will allow us to carry on some of the 
computations related with the CoCo arbitrage-free price. First, letting 7}))^>o 
stand for the price of the default- free zero-coupon bond with maturity 7} , we define 




( 5 ) 
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the Tj -forward measure through its Radon-Nikodym derivative with respect to 
P* as given by 

T- 

e“ /o'' '■«d« 

“ B(0, 7}) ■ 

We say that P/' is given by taking the bond price {B{t, Tj))t>o as numeraire. 
Similarly, but now taking the issuer’s share price {St)t>Q — without dividends — as 
numeraire, we obtain the share measure ; its Radon-Nikodym derivative with 
respect to P* is given by 



g-/o^hM-/c(w)]dwc 



dP* 



^0 



(7) 



In what follows, expectation with respect to P*, P^/ and P^*^^ will be denoted by E*, 
E^/ and E^*^\ respectively. 

Proposition 2 The discounted CoCo arbitrage-free price, on the set {^ < r}, equals 



Ttf — IE [^1{^>7’} I H- C^IE I ^ IE [ cyl{tj>ry} I (8) 

j: Tj>t 

— C 5 ^ ^ 

= KB(t, T)V^(t > T\ ^,) + J ' p<^>(r < T\ ^,) + T Tj)P^>(Tj > Tj\ (9) 

J K(u)du , 

c y. Tj>t 



Proof Due to the Condition (F), to receive at time r is equivalent to receive 
^St at time T. Therefore the payoff in (5) is equivalent to 






K 



j=i 









( 10 ) 



and thus expression (8) for the price follows by preconditioning, taking into account 
that Cr = As for (9), it suffices to notice that, in light of the abstract Bayes rule, 

for every Z e , P/' ) we have 



Eb'[Z| = 



E* 


■ dP/-' 
Z 

dP* 






■ dP/-' 


- 


E* 


dP* 





E* 



Ze“/o' 



e-/o'-«d»5(r, 7}) 



( 11 ) 
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and thus 




( 12 ) 



Similarly, for every Z e we have 



(13) 



Combining (8) with these identities we get the expression (9). 

With Proposition 2 at hand, the subsequent difference between models relies on 
how conversion and coupon cancellation is defined, and how the corresponding prices 
are evaluated. In this paper we follow a structural approach to price CoCos. That is 
to say, given a model for the share price (5'^)^>o, a series of critical time- varying 
barriers I and V are set in such a way that f ^ > • • • > f and the credit events 
are given by 

T := inf{t > 0 : St < and zj := inf{t > 0 : St < iHt)], j = 1, m, (14) 

with the standard convention inf 0 := oo. Then our main concern is to derive analyt- 
ically tractable formulas for (8)-(9), either in the form of closed formulas or efficient 
simulation methods. Later on in Sects. 5 and 6 we shall incorporate short-term uncer- 
tainty and extension risk to the pricing problem. 



2.2 Pricing CoCos with Write-Down 

In the case of CoCos with write-down, upon the appearance of the trigger event the 
investor does not receive a certain amount of shares. Instead, she receives only a 
fraction R e (0, 1) of the original face value K, provided that the issuer has not 
defaulted. Let 8 denote the random time at which the issuer may default. Then the 
payoff of this Co Co contract with write-down equals 



Similarly to Proposition 2, we can give now a model-free price formula for the 
Co Co with write-down. For this matter, we make no further assumption beyond 
model consistency in the sense that 8 and r are modelled in such a way that 8 > t 
so that default may only occur after conversion. 
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Proposition 3 The discounted arbitrage-free price of the CoCo with write-down, 
on the set {^ < r}, equals 

r)P^(T > T\ ^t) + RKB(t, T)F^(S > T\ ^t) 

m 

+ Y, CjB{t. Tj)F'^>{xj > Tj\ (15) 

j: Tj>t 



Proof It suffices to notice that the payoff can be rewritten as 



^!{(5>5'}l{r<5'} H” 1 {t>5} = H” (^1{t>5} + (1 “ ^)1{t>5}) 

= ^!{(5>5'}1{t< 5'} H” ^l{r>5'}l{(5>5'} H“ (1 “ ^)1 {t>5'} 

= ^1{(5>5'} (l{r<5} H” l{r>5'}) + (1 “ ^)1 {t>5'}9 

where for the second equivalence we have used the identity l{r> 5 }l{ 5 > 5 } = ^{r>s}, 
which holds due to the consitency assumption z < 8. 

By comparing (9) with (15) we can see that the techniques used to price the CoCo 
with conversion can be readily applied to CoCo with write-down. Across this work 
we focus on the former contract. 



3 A Model with Stochastic Interest Rates 

In this section we assume that the price of default-free zero-coupon bonds are sto- 
chastic. More specifically, forj g {1, ..., m}, the default-free zero-coupon bond price 
(B(t, Tj))te[0,Tj] is assumed to have the following P*-dynamics 

dB{t, Tj) u 

— ^ = r,dt + Y hit, Tj)dW^, (16) 

’ k=l 



where each bk is a positive deterministic cadlag function, and {W } , . . . , Wf)te[0j] is a 
J-dimensional Brownian motion with respect to the risk-neutral probability measure 
P* and the trader’s filtration F. We shall assume as well that the share price (St)te[0,T] 
obeys 

diS ^ 

^ = [r,~ icimt + Y^kim^l (17) 

^ k=l 

where a \= {cfk)i is a positive deterministic cadlag function such that, for all t e 
[0, 7}], the inequality ||cr(0 — b(t, Tj) | > 0 is satisfied. 
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The conversion and coupon cancellation events in our model are linked to the 
asset dividends and the evolution of bond prices, in such a way that conversion is 
triggered as soon as (5'^)^>o crosses 



it := r^)exp 




K(u)du , 



0<t<Tm; 



and similarly, forj g { 1 , ..., m}, the 7 -th coupon cancellation is triggered as soon as 
(St)t>o crosses 



Tj) exp I /c(t/)di/| , 0 < r < 7} 
M7, r = Tj. 



(18) 



The parameters Mj and Lj are assumed to be given non-negative constants satisfying 
Mj > Lj, with Lyn < Cp and 



^;+i 



< exp 




k{u)&u , 



7 = l,...,m — 1 , 



(19) 



so that the required ordering >£?>•••> is fulfilled — thus ensuring that 
0 < TJ < Tj implies Xj < 17 + 1 , for 7 = 1, ..., m — 1. Clearly the Mertonian condition 
(i.e., Stj must be bigger than Mj) can be removed by taking Lj = Mj. See Fig. 1 for 
an illustration of the barrier’s shape and parameters. 




Fig. 1 The graph illustrates the share price {St)t>Q, along with the barriers d and its parameters 
Lj and Mj. The first barrier is hit at f = T\, whereas the third one is hit at some T2 < t < T3. On 
the other hand, the second barrier is not hit since the share price stays above on the whole period 
[To, T2], and the Mertonian condition is satisfied, Le., St2 > M2. Conversion is not triggered either 
since the barrier I is never hit by (St)t>o 
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In the current setting, the process \= log plays a fundamental role. 

Indeed, from the definition of the random times r and tj (see (14)) and the barriers 
I and & it follows that 



{r > Tm) = 



inf Uf > 0 

^<t<Tm 



and {Xj > Tj) 




j = 1, 



From this observation we have that, in order to price the CoCo contract, we need to 
be able to compute the conditional joint distribution of f^} )’ where we have 

defined := info<^<r^ To this matter, notice that an application of the Ito 
formula tells us that 



du{ = -^ \a{t) - bit, Tj)fdt + \\a(t) - bit, 7})|| dwj\ 

where (wJ^^)t>o is the -Brownian motion given by the Girsanov theorem, corre- 
sponding to the probability change (6). In fact we can see that under we have 
similar dynamics 



dui = 1 Wait) - bit, 7})f dr + Wait) - bit, 7})|| 

where (W®)f>o is the -Brownian motion corresponding now to the probability 
change (7). Consequently, a time change given by 

ajit) ;= [ ||cr(s) - bis, Tj) f d^, 0 < r < 7}-, (20) 

Jo 

renders the fundamental process (f/^ := log -^h)t>0 a drifted Brownian motion. 

(t) 

Then we can apply a known result (see [34]) on joint distribution of the Brownian 
motion and its running infimum in order to obtain the following closed-formula for 
the CoCo price. See details in [13]. 

Proposition 4 In the current setting, the CoCo arbitrage -free price, on the set [t < 
Xm], is given by 



. ^ K{u)du 

= Z Bit, Tj)0i-d'+ - Dj) - ^ 0id^_ - Dj) ) (21) 



j: Tj>t 

-h KBit, Tffi) -|- I Cr — 



(Cr - [imBit, T^)0id’l) + 5 , 6 -/-"''" , 
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where 




, and 



1 




^J fp \\a(u) - b(u, Tj)f du 



( 22 ) 



and 0 denotes the standard Gaussian cumulative distribution function. 



3.1 The Black-Scholes Model and the Greeks 

The Black-Scholes model is obtained as a particular case of the model above, by 
taking default-free bond prices with null volatilities — i.e., by taking the Z?(-, 7}) in 
(16) to be zero. Consequently, the closed-form price formula given by 
Proposition 4 can be used in order to derive the Greeks, Delta A and Vega v, which 
respectively describe the CoCo'^ price sensitivity to share price and volatility. We 
have the following. 

Proposition 5 Let 0 and 0 denote, respectively, the standard Gaussian cumula- 
tive distribution and density function. In the Black-Scholes model, the CoCo’s price 
sensitivity to share price A := ^ and to volatility v := ^ are respectively given by 





and 



V 




where 




log^ - (rip \(y^){Tj-t) 



j = 1, 



- 1 



(23) 
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Some remarks are in order. On the one hand, it is has been documented that by 
actively hedging the equity risk, investors can unintentionally force the conversion 
by making the share price deteriorate and eventually trigger the conversion. This 
situation is referred to as the death-spiral ejfect. Now, from the above expression for 
the Delta A it can be checked that it is strictly positive, and in fact one can observe 
that the Delta A increases sharply when the time to maturity T decreases. Thus one 
of the conclusions in [13] is that the coupon cancellation feature leads to a flatter 
behaviour of the Delta Z\, hence reducing the death-spiral risk. On the other hand, it 
can also be checked that the Vega v is strictly negative, this tell us that an increase in 
the volatility translates into a decrease in the prices. Such observation is clearly in 
line with the intuition that a higher volatility will increase the probability of crossing 
the barriers i and defining the conversion and coupon cancellation events. 



4 Advanced Models 

4.1 Incorporating the Heston Stochastic Volatility Model 

Let us start this section by remarking the fact that the arguments preceding the obtain- 
ment of Proposition 4 hold even if the share and default-free bond price volatilities 
{i.e., a and Z?(-, 7}) in (17) and (16)) are no longer deterministic. However, the time 
change aj in (20) would be now stochastic and, consequently, the time-changed 
fundamental process dynamics 



would no longer match those of a drifted Brownian motion. Thus one anticipates that 
the pricing problem, in the setting of stochastic volatility, will lead to closed-form 
formulas only in few cases, and require more advanced numerical tools otherwise. 

As a particular model, we shall assume that the volatilities are stochastic according 
to the work of [26]: we consider a new stochastic factor {Vt)t>o acting on both a and 
/?(-, Tj), in such a way that the dynamics in (17) and (16) are now replaced by 




(24) 




(25) 



and 




d 



( 26 ) 



192 



J.M. Corcuera and A. Valdivia 



respectively. The factor (Vr)r>o is given as the solution to the following SDE 

dV, = [a- pVt]dt + vVVtdZr, 

where Ck:, ^ and y are constants, with 2ck: to ensure the positivity of the solution, 

and (Zt)t>o is a one-dimensional -Brownian motion. Similar dynamics under the 
share measure P^*^^ are assumed to be satisfied by 

If we further assume the independence between the noises driving the prices and 
their volatilities, then we see that the P^^ -Brownian motion in (24) is independent of 
the (now stochastic) time change 

Ujit) =[ Vs ||cr(s) - b{s, Tj) f ds, 0<t < Tj. (27) 

Jo 

Thus, by a preconditioning argument, we obtain the following extension of Proposi- 
tion 4. 

Proposition 6 In the current setting, the CoCo arbitrage -free price, on the set [t < 
Tm], is given by 



^t= Y. Bit, 7})E^n^(-< - - 

j,Tj>t V 

KS, 






+ KB{t, TmW"'[^i-d’!J)\ ^t\ J7^ 

Lj^it ^ K{u)du 

+ CrStt-JJ"' + CrLmBit, Tm)¥J^\<P{d1)\ 



where 



d’^ 



log 






± 2 St IkW - bis, Tj] 



ds 






^Vs \\a(s) -b(s,Tj)\\ ds 



and 



Dj = 



1 

||or(s)-fo(s,7}-)||^ds 




(28) 



(29) 



From the pricing formula above we can see that the Co Co price is related to the 
price of binary options. Indeed, for instance, for a binary option with maturity Tj and 
strike Mj we have 
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E* 






{STj<Mj] 






■ B{t, Tj)E^^ |^l{S7^.<My) 



= B(t, TjWJ 






Moreover, according to our previous discussion, conditioned to := (t(Vs, 0 < 
^ < Tj) the random variable U^, is normally distributed, so that we also have 



IP^j 





= E^J[<P(d’++Dj)\^tl 



Let us now give an explicit computation of the probability above; this illustrates how 
the CoCo price can be computed. For this matter, we use the relationship between 
the characteristic and distribution functions (see for instance [38]) which allows us 
to write 






Mi 

<logT^ 



1 




1 

2n 



f 


















-d?, 



where 

Ut, y,; := E^j[exp[i^(u{.)}\^,] = E^j[exp[i^{u{.)]\U„ V,], (30) 

and the last equation holds by Markovianity. Hence the problem of computing 
the probability above translates into the problem of finding an expression for 
(p^j (t, u,v; ^). It follows from the Ito formula that 



d(p^- 






+ - 



1 



2..T/ 



2 du^ 



-VCTrp. -| 



1 



1 dcp^j 



2 dv^ 



-y^v - -~—va^ + ——(a - M =0, (31) 

2 du J dv 



with the boundary condition ^^^(7), u, v; §) = e^^“, and where cr^. = \\a—b{-, 7})|p. 
For an affine solution like 

u, v; §) = QMTj-t)+Bj{Tj-t)v+i^u^ ^ 32 ) 

the PDF in (31) is reduced to the Riccati equation 



dB, 



w - \y^Bj + PBj = - _ 1 



dAj „ 

-W = «'®/’ 



(33) 
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with Aj and Bj vanishing at ^ = Tj. As shown in [13], for a constant function cr^., 
such equation is explicitly solved by 




+ p exp {-A,(7} -t)}-\ 



exp(-M7}-r)} + §]^’ 






and 




log ((X(^) - /I) exp {-X(^)(7}- - f)} + xfpf ) 




where := ^ ~ V* denote the analytic extension of the 

real square root to C \ M_ . 

4.2 An Exponential Levy Model 

In this section we shall consider an exponential Levy model for the share price. As 
opposed to the previous sections, we shall now consider a numerical approach to 
pricing, based on exploiting the so-called Wiener-Hopf factorization of the driving 
Levy process (Xt)t>Q. This approach has been recently applied in order to price 
contracts with path-dependent payoffs as in [12, 31]; see more details below. 

4.2.1 First-Passage Times and Wiener-Hopf Factorization 

Let be a Levy process with characteristic triplet (/x, a, v), and denote its 

characteristic exponent by For details and proofs of the following arguments we 
refer to [1]. 

Recall that ife(X) is an exponential random variable with parameter A., independent 
of (Xt)t>o, then we have the following equality in distribution 



Xe(X) — / + 5 , 



where I and S are independent random variables, distributed as 




and Xe(x) '= sup Xu, 

0<u<e{X) 



respectively. Moreover, 

E [exp (zX^a)}] = E [exp {zXe(A))] E [exp (zXe(A)}] • 



(34) 
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We shall refer to (34) as the Wiener-Hopf factorization. In fact, in general it holds 
that 



Consequently, the knowledge of one of the factors in (34) allows us to establish the 
other one. A particular case of interest arises when {Xt)t>Q is a spectrally negative 
process since, in this case, it is known that the right factor in (34) is given by 

ft (^) := E [exp [zXe(x) }] = ’ (35) 

where fx is a constant, depending on A., defined as the solution to 

(36) 

Therefore, once we have computed fx explicitly, we obtain the following expres- 
sion for the left factor in (34): 



f,-W:=E[expfe„)] = ;^-^^. (37) 

This expression can be linked to the distribution function of by partial integration. 
Indeed we have 



rOQ rO 

fXiz) = / Xe-^^dl / ze^^P(X, > 

Jo J —oo 

poo poo 

= Xz / ^)dfd^ (38) 

Jo Jo 

= XzF(X,z) 

where we have defined F(t, §) := IP(X^ > — §), and denoted its Laplace transform 
by F. As argued by 133], by combining (37) and (38) we can recover F by the standard 
Fourier transform inversion. Further, the result can be numerically computed in an 
efficient way, provided the condition 

1 

lim - [e^ - 1 - ztxl{\jc\<i}] v(dx) = 0, (39) 

holds true. This condition is imposed in order to facilitate the computation of fx (i-e . , 
the solution of (35)) by means of suitable integration contour change. In fact, such a 
change allows to take fix = This result is summarized in the following lemma. 
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Lemma 7 For fixed t and and given the parameter set (Ai, A 2 , h,h, N\,N 2 ), 
define 



a\ := 



2th' 






g{x) := fx{x/ix) 



and, for every g N, 






h\h2 

47V ^ 



ly ly 1 

n=-N k=-N 



F{g{ai + in/zi), «2 + 



If the condition (39) is satisfied, then the following approximation holds true 

^ /Ni\ 

nXt>^) = Y.(^)sN,+nit,^), 



where the symbol = indicates an Euler summation. 

The double sum in the lemma is used as an initial approximation of F. Here the 
parameters (Ai , A 2 , /i , h) are positive real numbers chosen large enough in order to 
control the aliasing error. The final Euler summation is used in order to improve the 
accuracy of the raw approximation sj^. It is suggested that choosing Ai = A 2 = 22, 
/i = /2 = 1, = 12 and A ^2 = 15 gives satisfactory results. For further details see 

[33] and references therein. 



4.2.2 The One-Sided CGMY Levy Process 

Hereafter we shall focus on a particular spectrally negative Levy process (Xt)t>t) 
known as one-sided CGMY process, or simply CMY process. This process has no 
continuous part, and only one-sided jumps with its Levy measure being given by 

v(dx) = Cexp{— Mx}|x|“^“^l{;c< 0 }dv, (40) 

where C, M > 0 and T < 1 are constants. Further, its characteristic exponent can 
be obtained in closed-form as 

nO 

i^xiz) = tiz+ [e^ - 1 - zxl[\x\<i]] v(dx) 

7 — 00 

= IXZ + Cr(-Y)((M + zf - M^), 



( 41 ) 
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where F stands for the Gamma function. Thus it is apparent that the condition (39) 
holds true. Before we proceed, let us mention that the CGMY processes are also 
referred to as Tempered stable processes. On the other hand, by setting the parameter 
7 = 0 (resp., 7 = 1/2) the CMY process becomes a Gamma process (resp.. Inverse 
Gaussian process). For details on CMY processes we refer to [4, 31] and [36, Section 
2.3.5]. 

In order to price CoCos we need to understand the behavior of (Xt)t>o both under 
the risk-neutral measure P* and the share measure P^*^^ . The following result shows 
how the Levy characteristics of (Xt)t>o change under Esscher transforms. 

Lemma 8 For every real number 0, consider the probability measure P“, equivalent 
to P, given by 

dP“ exp [ciXt} 

"d^ “ E [exp {aXr]] ' 

Assume Ma := M — a > 0. Then the Levy exponent of{Xt)t>o under P“ is given by 

/*0 

lAx(z) := Zfla + [e^ -I- Zx1[\x\<d] Va{dx), 

7—00 



where 

/ x{e^^ — l)v{dx), and %(dv) := C exp{— Mojx}|x|~^~^l|;c<0|dv. 

^W<1 

Proof The first part follows from (41) and [35, Theorems 33.1 and 33.2]. Now, using 
the expression in (40) we have 

Vocidx) = Cexp{-(M - ck:)x}|xr^“^l{;c< 0 }d^ = Cexp{-Mc,x}|xr^“^l{;c< 0 }d^- 

The assumption > 0 assures that (C, 7) is a rightful parameter set for a 

CMY distribution. 



4.2.3 Application to CoCos 



We shall assume that, under P*, there is a pure-jump (C, M, 7)-Levy process (Y^)^>o 
driving the share price (5'^)^>o in such a way that 



E*[exp{Z,)] 



exp{/xr + Y^}, 



t > 0, 



where the interest rate r and the dividends k are assumed to be constants, and we 
define m := — log E* [e^^ ] and set /x := r — K-\-m. Further, we shall assume that the 
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parameter M is bigger than 1 . We remark here that, on the one hand, this technical 
assumption will allow us to accommodate the Variance Gamma (VG) process con- 
sidered in [30]. On the other hand, this assumption is consistent with the numerical 
experiments reported by [31]. 

Proposition 9 In the current setting, the price of a CoCo at time D <t <T can be 
numerically approximated in an efficient way by means of the expression 

m 

Tj) + (f, r„), 

j: Tj>t 



where 



N2 /i\j \ 

") 4 ,+« f - t, log 4 + ruijj - 0) , 7 = 1, m, a = 0, 1, 

n=0 ^ ' 



(42) 



with the symbol = indicating an Euler summation, and 



^01 Q §) •— ^ ^ ffo(_+YCr( Y) {Ma + fx^^t) 

n=-N k=-N 



4fiat^hh 
Fa(ga(ai + inhi), a 2 + 

with the parameters (ai , U 2 , hi, h 2 , h, h, hfi, N 2 ) given as in Lemma 7, and 

X - ZHa 



Fa(X,z) := 



(A - zi^a - Cr(-Y)({M„ + zV - Ml)) zX ’ 



gaix) 



:= X + Cn-Y) + ii^xf - mI^ , 



a =0, 1, 



where and pi^ defined in Lemma 8. 

Proof Taking into account the general expression for the CoCo price {cf. Proposition 
2), computing boils down to compute 



Ppt, Tj) := P* {tj > 7}| #•,) = P*(Xr._, > -^) 



^=\ogiSt/&{t))+m(Tj-t) 



for j = 1, ..., m, and 

Pl(t, T^) ;= P(^l (r„ > T^\ > -?) 



l=\0g{St/l^{t)) + Uj{Tm-t) 



These computations can be carried out by means of Lemma 7. Indeed, under this 
exponential Levy model, the share measure (resp., risk-neutral measure P*) 
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coincides with an Esscher transform of parameter a = 1 (resp., a = 0). In light 
of Lemma 8, the driving noise (Xt)t>o will remain a CMY process under but 
now having the shifted parameter set (C, Mi, 7) = (C, M — 1, 7). Moreover, this 
implies that the Levy measure of (Xt)t>o also satisfies the condition (39) under 
Thus, if we write = P* and P^ = by the reasoning in Sect. 4.2.1 we see that 
the Laplace transform of Fa (t,^) :=F^ (X^ > — §) is given by Fa as defined above. 
Moreover, the correspondent contour change is given by ga . 

Remark 10 [12] provides an alternative approach which exploits the Wiener-Hopf 
factorization in a different way: instead of computing first-time passage probabil- 
ities as done here, what is computed is the joint density of (X^, X^). As the noise 
driving share prices, the authors consider the so-called Beta-Variance Gamma (fi- 
VG) process — also referred to as Lamperti-Stahle process by [3] — which exhibits the 
same exponential decay as the Variance Gamma process, hence leading to a smile- 
conform model. Lor this y0-VG process the distribution of the variables Xe{X) and 
X^{x^^ can be specified, thus obtaining the Wiener-Hopf factors and . Taking 
(8) into account, combining the knowledge of the density with a Monte- 

Carlo technique due to [29], the authors provide an efficient numerical pricing of 
CoCos. 



5 Triggering Conversion Under Short-Term Uncertainty 

Linking credit events to the movements of 3. fully observable (/.^., F- adapted) process 
(Ut)t>3 is certainly one of the most appealing features of structural models. Indeed, 
this/w// observability assumption — hereafter referred to as (Al) — gives rise to clear 
and analytically tractable models as we have seen in the previous sections. When 
considering contingent capital contracts such as CoCos, however, the assumption 
(Al) seems arguable since in most cases regulatory capital depends on the balance 
sheets of the issuer, and those sheets are updated only at a series of predetermined 
dates (tj)teN- Thus we are interested in considering the following partial observability 
assumption. 

Assumption (AlO- The fundamental process {Ut)t>o is fully observable only at 
predetermined dates (tj)ten- 

On the other hand, when the process {Ut)t>o is related to the share price, it is 
also commonly assumed that the correlation between the noises driving the share 
price and (JJt)t>o is equal to p = 1 (or p = — 1) — hereafter this assumption will be 
referred to as (A2). Nevertheless, it would be reasonable to consider the chance that a 
different (possibly time-dependent) correlation parameter p g [—1, 1] may provide 
a better fit. Consequently we are also interested in considering the correlation p as 
an additional rightful model parameter by taking the following alternative to (A2). 

Assumption (A20- The correlation p between the noises driving the share price and 
(JJt)t>{) may vary. 
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In what follows we revisit the simple framework of Sect. 3.1 in order to illustrate 
these ideas and show how the pricing problem is modified under (Al^ and (A20- For 
a full study of this short-term uncertainty model we refer to the forthcoming paper 
[15]. 



5.1 Pricing CoCos on a Black-Scholes Model Under 
Short-term Uncertainty 



As shown in Sect. 3.1, in the Black-Scholes model, 



d5r = 5r([r - /c]dr -I- o-dW;), 



the cancellation of the 7 -th coupon is triggered as soon as the process 

S 1 

dUt := dlog — = --cr^dt + adW*, 

tf 2 

crosses the critical value log ^,7 = 1 , ..., m, whereas for conversion zero is the 
critical level. In this setting Assumption (A2') is translated as the correlation structure 
between the noise driving (St)t>o and that of the new process 

dUt(p) := -I- adwt := -f ad{pW* + - p^Z,). (43) 

where p is the given correlation parameter, and (Z^)^>o is a second Brownian motion, 
independent of Thus, instead of the process above, we shall now 

consider the parametric family {Ut{p))t>o whose driving noise (Wf)t>o is also a 
Brownian motion but correlated to (W*)t>o, in such a way that dW^ dW* = pdt. 
Further, the time at which the 7 -th coupon may be cancelled is given by 



Tj(p) := inf 



f > 0 : Ut(p) <\og-^ . 

H . 



As for Assumption (AlO, notice that the full information flow corresponds to 
:= cr(W^*, Z„ 0 < ^ < 0 = V t > 0, 



whereas, setting L^J := min{tj e { 0 , t\, t2 , ...} : tj < t < ^ 7 + 1 }, the information 
available to the modeller is now given by 

:= V cr(Z„ 0 < ^ < Uj) = ^ ^ 0- 
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Since % ^ and the equality holds only at the predetermined dates [tj, j e N}, 
we can think of (Z^)^>o as an extra source of noise which clears out at update times 
The fact that the extra noise is cleared out at motivates the notion of 

short-term uncertainty, and it has two important implications. On the one hand, our 
model differs from other partial or incomplete information models like [9, 10, 21], or 
[25] since the information structure is different. On the other hand, as opposed to other 
structural models, the short-term uncertainty considered here prevents the conversion 
time Tj{p) from being a stopping time with respect to the reference filtration F, which 
is generated by the relevant state variables and the risk-free market. Hence, one can 
investigate conditions under which r, (p) admits an intensity, as done by [9, 21], or 
[27]. 



5.2 Coupon Cancellation Probabilities Under Short-Term 
Uncertainty 



Let us show how the coupon cancellation probabilities are modified under the 
assumptions (Al') and (A2')- We begin by defining two auxiliary processes 

:= crVl - p'^(Zt-Zyi) and := p log -^ + p log t > 0. 

l\_t\ 

These processes have an important role in the computations within our short-term 
uncertainty model since they appear implicitly in {Ut{p))t>o according to the fac- 
torization 



+ (44) 

It is apparent that the term between parentheses belongs to v . On 

the other hand, is independent of and it is normally distributed with zero mean 
and variance 



We note here that the variance v^{t) represents a key quantity within this frame- 
work. Indeed, on the one hand, it actually encodes the two new features of our 
model: the factor 1 — measures how close (5'^)^>o and (f/?)r>o are to being com- 
pletely correlated; whereas the factor t — \j\ measures the elapsed time from the 
last information update. On the other hand, as the following result suggests, coupon 
cancellation probabilities and other analytical formulas obtained within our model 
depend explicitly on v{t). 
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Proposition 11 For every x G M define the random time Xx{p) := inf > 0 : Us 
(p) < x]. Then, for every ^ <t <T, the following equation holds true on {r > L^J } 



?*{zAp) > T\:^t) = 

E* 






where the expectations above are restricted to the values 



X-Uyt\{p)-^t±\cT^{T -t) 



D± = 



Cf 






Moreover, 
and 






(45) 



4>{z)Az (46) 



E* 






\D+ - 



<t>{z)Az, 

(47) 

where d> and 0 stand, respectively, for the standard Gaussian cumulative distribution 
and density functions. 

Proof Since under the computation is known, for every t = tj we have, on 
{Tx{p) > t], 



^fe(p) > ri^.) = P*fe(p) > rm 



( 



-X + Utjip) + \oHt - tj) 



- tj 

X - c/,.(p) + \a^{T - tj) 



WT - tj 



) 



) 






Define 
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For t ^ [tj, j G N}, by conditioning we get, on {Tx(p) > [t]}, 

¥*(tAp) > T\^t) = E* [P*fe(p) > = E* [0 (-c?i(r)) I 

- E* (dl(t)^ #r] . 

In these terms, for L^J < t, the first summand above satisfies 

E* [0 {-d-{t))\ #f] = E* 1^0 (-D- + 



0]- 



where the right-hand side of the expectation above is restricted to the value of D- 
given in (45); in fact the equation above reduces to (46) since ~ A^(0, v^{t)) for 
every fixed r > 0. Similarly for the second summand, it follows from (44) that 



r Q-(Ut(p)-x )0 



(d+ ^|==)j 

= ^-(u^,i^p)-x) ^ (d+ ^ 1=) j . 



In order to obtain (47), let us consider the change of measure given by 



dP 

dP* 



= exp 



1 2 

- -v^it) 



= exp 



[ ap - j](s)dZ, 

Jo 

-i [ {oxjl - pH[\,i,Js)fds 
^ Jo 

In virtue of the Girsanov theorem, the process 

:=Zs~ [ (Ta/i pH[itjj](u)du, s >0, 

Jo 

follows a P -Brownian motion. Thus 






D+- 






,)] 



-Ep. « 0+ 






a^T - 

fo o^\/l - + v'^it) 



a^T — t 
.2 



)] 



Jr \ a^T -t ) JtjtvJP) 

Jr \ a^T -t ) 



dx 
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where the first equivalence follows from the abstract Bayes’ rule, and for the last 
equivalence we have simply used the standard change of variables z = . 

Jeanblanc and Valchev [27] study the role of information on defaultable-bond 
prices within a Black-Scholes setting, with constant parameters and fiat default bar- 
rier. In some sense, the short-term uncertainty model considered here can be seen 
as a bivariate extension of [27]. Despite this analogy, immediate differences arise. 
For instance, the survival probabilities obtained by [27] are constant between obser- 
vation dates, i.e., within each interval [tj, Whereas in Proposition 11 these 
probabilities vary in continuous time. This difference relies on the fact that, event 
though within each interval [tj, our knowledge of the short-term noise is 
constant, we still fully observe the evolution of (St)t>o and all the other F-adapted 
state variables. 

In order to conclude this section, let us recall that in light of our discussion 
in Sect. 3.1, CoCo prices can be obtained in our current setting once we compute 
expressions of the form 



P*(T^(p) > Tj, Stj > Lj\,^t), and St„ > 

It is worth noticing that the conditional joint distribution of (zj(p),STj) = 

(inf 5 <j^. Us(p),STj) cannot be computed directly from Proposition 11 since the 
entries of this vector are driven by two different (though correlated) Brownian 
motions. An additional complication comes from the fact that the current infor- 
mation about one of them might be incomplete. The aforementioned distribution, 
and full details on the model, can be found in [15]. 



6 Extension Risk 



According to the new regulatory Basel III framework, CoCos can be categorised as 
either belonging to the Additional Tier 1 or Tier 2 capital category. In order to belong 
to the former class, a CoCo is supposed to have the coupon cancellation feature and, 
further, no fixed maturity is to be imposed to the contract. Instead, the issuer is entitled 
to redeem the CoCo at any of the prespecified call times {T/, / g N}. Moreover, as 
opposed to the common practice on callable contracts before the 2008 financial crisis, 
the definition of this contract does not contain any incentive {e.g., a coupon step-up) 
for the issuer to redeem at the first call date. Investing in such a contract has the 
inherent risk of a financial loss due to the lengthening of the (investor’s) expected 
maturity duration which ultimately postpones the payment of the face value K. This 
risk is referred to as extension risk. Two recent papers [14, 18] have addressed the 
problem of pricing CoCos belonging to the Additional Tier 1 capital category. As an 
illustration, let us revisit the Black-Scholes model in Sect. 3.1. 

In order to emphasize the correspondence with call times, we add now an extra 
index / g N to the coupon structure, in such a way that a coupon Cij will be paid 



Pricing CoCos with a Market Trigger 



205 



at Tij provided Tij > Tij. It will be assumed that for every i e N the ordering 
Ti-i < Til, ... < Tim •= Ti holds, where we set To := 0. For the sake of clarity, let 
us remark that in the current setting, the barriers in (18) and their parameters become 



:= 






Mii, 



t — Tij. 



From the issuer’s point of view, the question of whether to postpone or not the 
face value K payment depends on which alternative is cheaper. Hence, similarly to 
the situation of Bermuda options (see for instance [37]), the discounted price of a 
CoCo belonging to Additional Tier 1 capital category equals 





inf E* [ 


•^T„ 


, t = T„e {Ti, e N} 


(a) 


n,:= 


ee% L 






(48) 






t G (Tn, Tn-\-l), 


(b) 



where ^ stands for the set of stopping times taking values in {Ti, i > n] and 



l:Ti=e m ^ 

^ 0 ^ — ^ '^^^iA{rij>Tij,ST-->Mij} + Kq Srl{r<e} - 

i=n+lj=l 

It is important to remark that even though the general optimal stopping theory allows 
us to characterize the solution to the optimization problem in (48a), this is not enough 
to tackle whole pricing problem. Indeed, the solution to (48a) must be obtained in 
a relatively explicit way in order to be able to give a reasonable expression for the 
price in-between call dates (48b). Here we shall address the finite horizon case, i.e., 
the case where there are only finitely many call dates {Ti, ..., 7)v} and 7)v < oo. 

In this case, for every fixed Tn, the solution to the optimization problem in (7.6.1) 
is related to the lower Snell envelope {Y^ )ke{n,...,N] of the process )ke{n,...,N] 
given by 



:= 2 



= Kq + — e^^5rl{T<r^} + ^ Cijl{x->Tu,ST ;, • 



i=n+l j=l 



Having a finite horizon allows us to obtain (T^ )ke{n,...,N} by means of the following 
backwards procedure 



min 






k = N 
k = N - 



y(^) 



(49) 
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(yi\ 

As it turns out, the lower Snell envelope )ke{n,...,N] can be obtained in a rather 
explicit form. Indeed, for the first iteration of (49), if r > then what we have is 
the raw expression 



4-1 = {4"^ . E* [4' I } 

( K ~ ^ 

i=n+\j=\ 



E" 



■n+\ j= 

^ N m 

P i=n-\-l 7=1 

N—l m 
i=n+lj=l 






(50) 



+ l{r>r^_i}min 



Kq-^Pn-i ^ 



r K ^ ^ 1 

E* KQ-^'^^l{r:^TN} + ^tMxSTn} + 

L J 

K ~ ^ 



/=n-|-l 7=1 



where as in the previous section, 7tT^_-^ denotes the price of CoCo here with maturity 
7)v and coupon structure (c^j, T^j, PNj)JLi • to the share price Markovianity, the 
price 7tT^_-^ can be seen as function of St^_^ ; we denote this function simply by 
Now, as discussed in Sect. 3.1, the CoCo has a positive Delta, thus the 
function (x) is increasing and we can find a value such that 



S^_i := inf |x > 0 : 7Vn(x) > Ke . 



Hence we obtain 







i=n+l 7=1 

V-1 m 

K 



= Ke ‘l{i>T„_,,Srs,_i>SJ_,) + =e'"''Vl(r<r«_i) + E (51) 



i=n+l 7=1 



+ E* 









{tNj>TNj,STf^j >MNj,STj^_^ <S^_y} 



7=1 






So far we can see that basically all indicator functions appearing originally in (50), 
has been augmented in (51) by an additional condition on (i.e., 
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or > S^_ ^ ). In [ 14] it has been shown that the computation of the Snell envelop 

(Yj^ )ke{n,...,N} can be carried out by means of the above backward procedure. In fact, 
it can be seen that the payoff of a CoCo having the callability feature can be written 
in terms of 



CijhTiJ>Ty,ST^<sl...,ST■_^<s*_^,STy>MiJ] at times Ty, i > 1, 

Kl{r>Ti,STo<sl...,ST^_^<s*_^.ST^>s*} atr,-, i > 1, 

K 

— e 5'rl{rri<rA,,Sro<s*,...,Sr,i-i<sf^^_i) atT, 

where [r] is the element of {Ti, T)v} such that [r] — 1 < r < [r], and the 
additional variables 5'^ , . . . , S ^_2 are defined by analogous reasoning to that behind 
the obtainment of S^_^. Here Sq := oo and := 0 are set by convention. 

Proposition 12 If the CoCo with extension risk is active and conversion has not 
occurred, then its discounted arbitrage -free price is given by 

nt = ^ CyP* (^Ty > Tij, Sto < S^, Sti_i < > Mij, I 

ij: Tij>t 

+ KF*{T>Ti, St, < si..., St,_, <S*_„ St, > Sf \ #■,) (52) 

i: Ti>t 

Nr 

+ Z (t < Ti, St, < S^, ..., St,_, < SU,St, > S*\^t) ■ 

i=l 

Proof With the explicit description of the payoff corresponding to the CoCo with 
extension risk, the result is obtained as in Proposition 2, here taking into account the 
following identity 

I ["^1 ^ Stq < Sq, ..., 5'|-r]-i < 

= < Ti, St, < St,_, < S*_„St, > S*}. 

In view of this proposition, the obtainment of a closed-form formula for the price 
CoCo with extension risk requires the knowledge of the conditional distribution 
of (r, Stq, Sti, 5'r-) for i = 1, In the Black-Scholes model, this can be 

achieved by means of the following general lemma obtained in [14]. 

Lemma 13 Let a Brownian motion with drift /x and volatility a, and 

denote by x its first-passage time to level zero. Then, for arbitrary instants T\ < 

• Tn and arbitrary non-negative constants a\, an, on {x > t} the following 
equation holds true 
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P(r > Tn,Bj^ < ai, < an-i,Bj^ > a„| .^t) 

= P(-ai < St-j < ai, -a„_i < Br„_i < a„^i,BT„ > a„\^t) 

_g-2/icr ‘Sip(_aj < ai, -a„_i < Br„_i <«n-l,5r„ 

where Bjj = Btj — 2/x(7) — t), j = 1, /t, (^?)?>o stands for the natural 

filtration generated by (Bt)t>{). 
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Quantification of Model Risk in Quadratic 
Hedging in Finance 



Catherine Daveloose, Asma Khedher and Michele Vanmaele 



Abstract In this paper the effect of the choice of the model on partial hedging in 
incomplete markets in finance is estimated. In fact we compare the quadratic hedging 
strategies in a martingale setting for a claim when two models for the underlying 
stock price are considered. The first model is a geometric Levy process in which 
the small jumps might have infinite activity. The second model is a geometric Levy 
process where the small jumps are replaced by a Brownian motion which is appropri- 
ately scaled. The hedging strategies are related to solutions of backward stochastic 
differential equations with jumps which are driven by a Brownian motion and a 
Poisson random measure. We use this relation to prove that the strategies are robust 
towards the choice of the model for the market prices and to estimate the model risk. 

Keywords Levy models • Quadratic hedging • Model risk • Robustness • BSDEJs 
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1 Introduction 



When jumps are present in the stock price model, the market is in general incomplete 
and there is no self-financing hedging strategy which allows to attain the contingent 
claim at maturity. In other words, one cannot eliminate the risk completely. However 
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it is possible to find ‘partial’ hedging strategies which minimise some risk. One way 
to determine these ‘partial’ hedging strategies is to introduce a subjective criterion 
according to which strategies are optimised. 

In the present paper, we consider two types of quadratic hedging strategies. The 
first, called risk-minimising (RM) strategy, is replicating the option’s payoff, but it is 
not self-financing (see, e.g., [19]). In such strategies, the hedging is considered under 
a risk-neutral measure or equivalent martingale measure. The aim is to minimise 
the risk process, which is induced by the fact that the strategy is not self-financing, 
under this measure. In the second approach, called mean-variance hedging (MVH), 
the strategy is self-financing and the quadratic hedging error at maturity is minimised 
in mean square sense (see, e.g., [19]). Again a risk-neutral setting is assumed. 

The aim in this paper is to investigate whether these quadratic hedging strategies 
(RM and MVH) in incomplete markets are robust to the variation of the model. Thus 
we consider two geometric Levy processes to model the asset price dynamics. The 
first model (St)te[0,T] is driven by a Levy process in which the small jumps might 
have infinite activity. The second model (Sf)te[0,T] is driven by a Levy process in 
which we replace the jumps with absolute size smaller than e > 0 by an appropriately 
scaled Brownian motion. The latter model (5'f)^e[0,r] converges to the first one in 
an L^-sense when s goes to 0. The aim is to study whether similar convergence 
properties hold for the corresponding quadratic hedging strategies. 

Geometric Levy processes describe well realistic asset price dynamics and are 
well established in the literature (see e.g., [5]). Moreover, the idea of shifting from a 
model with small jumps to another where these variations are represented by some 
appropriately scaled continuous component goes back to [2]. This idea is interesting 
from a simulation point of view. Indeed, the process (5f )^e[0,r] contains a compound 
Poisson process and a scaled Brownian motion which are both easy to simulate. 
Whereas it is not easy to simulate the infinite activity of the small jumps in the 
process (St)te[0,T] (see [5] for more about simulation of Levy processes). 

The interest of this paper is the model risk. In other words, from a modelling 
point of view, we may think of two financial agents who want to price and hedge an 
option. One is considering (St)te[0,T] as a model for the price process and the other 
is considering (5'f)^e[0,r]- Thus the first agent chooses to consider infinitely small 
variations in a discontinuous way, i.e. in the form of infinitely small jumps of an 
infinite activity Levy process. The second agent observes the small variations in a 
continuous way, i.e. coming from a Brownian motion. Hence the difference between 
both market models determines a type of model risk and the question is whether the 
pricing and hedging formulas corresponding to (5’f)^e[0,r] converge to the pricing 
and hedging formulas corresponding to (St)te[0,T] when s goes to zero. This is what 
we intend in the sequel by robustness or stability study of the model. 

In this paper we focus mainly on the RM strategies. These strategies are considered 
under a martingale measure which is equivalent to the historical measure. Equivalent 
martingale measures are characterised by the fact that the discounted asset price 
processes are martingales under these measures. The problem we are facing is that 
the martingale measure is dependent on the choice of the model. Therefore it is clear 
that, in this paper, there will be different equivalent martingale measures for the two 
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considered price models. Here we emphasise that for the robustness study, we come 
back to the common underlying physical measure. 

Besides, since the market is incomplete, we will also have to identify which 
equivalent martingale measure, or measure change, to apply. In particular, we discuss 
some specific martingale measures which are commonly used in finance and in 
electricity markets: the Esscher transform, the minimal entropy martingale measure, 
and the minimal martingale measure. We prove some common properties for the 
mentioned martingale measures in the exponential Levy setting in addition to those 
shown in [4, 6]. 

To perform the described stability study, we follow the approach in [8] and we 
relate the RM hedging strategies to backward stochastic differential equations with 
jumps (BSDEJs). See e.g. [7, 9] for an overview about BSDEs and their applications 
in hedging and in nonlinear pricing theory for incomplete markets. 

Under some conditions on the parameters of the stock price process and of the 
martingale measure, we investigate the robustness to the choice of the model of the 
value of the portfolio, the amount of wealth, the cost and gain process in a RM 
strategy. The amount of wealth and the gain process in a MVH strategy coincide 
with those in the RM strategy and hence the convergence results will immediately 
follow. When we assume a fixed initial portfolio value to set up a MVH strategy we 
derive a convergence rate for the loss at maturity. 

The BSDEJ approach does not provide a robustness result for the optimal number 
of risky assets in a RM strategy as well as in a MVH strategy. In [6] convergence rates 
for those optimal numbers and other quantities, such as the delta and the amount of 
wealth, are computed using Eourier transform techniques. 

The paper is organised as follows: in Sect. 2 we introduce the notations, define 
the two martingale models for the stock price, and derive the corresponding BSDEJs 
for the value of the discounted RM hedging portfolio. In Sect. 3 we study the stabil- 
ity of the quadratic hedging strategies towards the choice of the model and obtain 
convergence rates. In Sect. 4 we conclude. 



2 Quadratic Hedging Strategies in a Martingale Setting for 
Two Geometric Levy Stock Price Models 

Assume a finite time horizon T > 0. The first considered stock price process is 
determined by the process L = (Lt)te[0,T] which denotes a Levy process in the 
filtered complete probability space (^2, F, P) satisfying the usual hypotheses as 
defined in [18]. We work with the cadlag version of the given Levy process. The 
characteristic triplet of the Levy process L is denoted by (a, We consider a 

stock price modelled by a geometric Levy process, i.e. the stock price is given by 
Sf = 5'oe^^ Wt G [0, T], where Sq > 0. Let r > 0 be the risk-free instantaneous 
interest rate. The value of the corresponding riskless asset equals e'"^ for any time 
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t G [0, T]. We denote the discounted stock price process by S. Hence at any time 
t G [0, T] it equals 

% = 

It holds that 

dS, = Stddt + StbdW, + S, [ - l)N{dt, dz), (1) 

JM.Q 

where W is a standard Brownian motion independent of the compensated jump 
measure N and 



a=a-r + + [ (e^ - 1 - zl{|z|<i)) ^{dz). 

^ Jm.q 

It is assumed that S is not deterministic and arbitrage opportunities are excluded (cfr. 
[21]). The aim of this paper is to study the stability of quadratic hedging strategies in 
a martingale setting towards the choice of the model. Since the equivalent martingale 
measure is determined by the market model, we also have to take into account the 
robustness of the risk-neutral measures. Therefore we consider the case where P is not 
a risk-neutral measure, or in other words 5 / 0 so that S is not a P-martingale. Then, 
a change of measure, specifically determined by the market model (1), will have to be 
performed to obtain a martingale setting. us denote a martingale measure which 
is equivalent to the historical measure P by P. We consider martingale measures that 
belong to the class of structure preserving martingale measures, see [14]. In this case, 
the Levy triplet of the driving process L under P is denoted by Theorem 

III. 3. 24 in [14] states conditions which are equivalent to the existence of a parameter 
0 G M and a function p(z; 0), z G M, such that 

[ |z(p(z; 0) - l)|f(dz) < 00, (2) 



and such that 

a=a+b^0+ [ z {p{z\ 0) - 1) l{dz) and l{dz) = p{z\ 0)€(dz). (3) 
J(|Z|<1) 

For 5 to be a martingale under P, the parameter 0 should guarantee the following 
equation 

ao = a - r + + [ (e^ - 1 - zl{|z|<i)) ^(dz) = 0. (4) 

^ Mq 

From now on we denote the solution of Eq. (4) — when it exists — by 0 q and the 
equivalentjnartingale measure by P@q . Notice that we obtain different martingale 
measures P@q for different choices of the function p(. ; 0o). In the next section we 



Quantification of Model Risk in Quadratic Hedging in Finance 



215 



present some known martingale measures for specific functions p (. ; 0q) and specific 
parameters which solve (4). _ 

The relation between the original measure P and the martingale measure IP^q is 
given by 



dlPep 

dP 



= exp (bOoWt - + [ [log (p(z; 0o)) A^(d^, dz) 

\ ^ Jo JRo 

+ 1 [ (log (p(z; 0o)) + 1 - p(z; &o)) €(dz)i . 

JM.Q / 



From the Girsanov theorem (see e.g. Theorem 1.33 in [17]) we know that the 
processes and defined by 

= dWt - b0odt, (5) 

N^^idt, dz) = N(dt, dz) - p(z; &oX(dz)dt = N(dt, dz) + (1 - p(z; 0o)) i(dz)dt, 



for Sill t G [0, T] and_z G Rq, are a standard Brownian motion and a compensated 
jump measure under 1 P@q. Moreover we can rewrite (1) as 

d5f = SMwf° + St [ (e^ - l)yV®o(df, dz). (6) 

J'M.Q 

We consider an -measurable and square integrable random variable Hr which 
denotes the payoff of a contract. The discounted payoff equals Hj = Ht. In 
case the discounted stock price process is a martingale, both, the mean-variance 
hedging (MVH) and the risk-minimising strategy (RM) are related to the Galtchouk- 
Kunita-Watanabe (GKW) decomposition, see [11]. In the following we recall the 
GKW-decomposition of the ^r-n^asurable and square integrable random variable 
Ht under the martingale measure IP^q 

Ht = (7) 

Jo 

where denotes the expectation under IP^q , is sl predictable process for which 

we can determine the stochastic integral with respect to S, and ^ square 

integrable -martingale with = 0, such that P@^ -orthogonal to S. 

The quadratic hedging strategies are determined by the process It indicates 
the number of discounted risky assets to hold in the portfolio. The amount invested in 
the riskless asset is different in both strategies and is determined by the self-financing 
property for the MVH strategy and by the replicating condition for the RM strategy. 
See [19] for more details. 
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We define the process 

y0a ^ £00 I ^ Vr e [0, r] , 

which equals the value of the discounted portfolio for the RM strategy. The GKW- 
decomposition (7) implies that 

V? e [0, T]. (8) 

Jo 

Moreover since ^ IP@q - martingale, there exist processes and 

such that 

= / xf^dWf^ + [ [ Yf\z)N^\ds,dz), Vr G [0, T], (9) 

Jo Jo Jrq 

and which by the IP@q - orthogonality of ^nd S satisfy 

X&ob + f 0o)f(dz) = 0. (10) 

7 Mo 



By substituting (6) and (9) in (8), we retrieve 

di>®« = dl¥®° + - 1) + Af®«(df, dz). 

Let 71 indicate the amount of wealth invested in the discounted risky asset 

in a quadratic hedging strategy. We conclude that the following BSDEJ holds for the 
RM strategy 

dvf° = + / Bf\z)N‘^°(dt, dz), 

7ro (11) 

where 

A®o = and B^^{z) = - 1) + Y^°{z). (12) 

Since the random variable Hr is square integrable and -measurable, we know by 
[20] that the BSDEJ (11) has a unique solution B^^). This follows from 

the fact that the drift parameter of V equals zero under IP^q and thus it is Lipschitz 
continuous. 
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We introduce another Levy process L^, for 0 < £ < 1, which is obtained by 
truncating the jumps of L with absolute size smaller than s and replacing them by 
an independent Brownian motion which is appropriately scaled. The second stock 
price process is denoted by = 5oe^ and the corresponding discounted stock price 
process is thus given by 

d^f = + S^,b&Wt + / (e^ - dz) + SfG(s)dWt, (13) 

J{\z\>s} 



for all r G [0, T] and Sq = Sq. Herein W is a standard Brownian motion independent 
of W, 

G^(s) = I (e" - and (14) 

ae = a - r + lr (b^ + G^(e)) + [ (e^ - 1 - zl{|z|<i)) i(dz). 



From now on, we assume that the filtration F is enlarged with the information of 
the Brownian motion W and we denote the new filtration by F. Moreover, we also 
assume absence of arbitrage in this second model. It is clear that the process U has 
the Levy characteristic triplet [a,b^ -\- G^(e), under the measure F. 

Let Fg represent a structure preserving martingale measure for . The charac- 
teristic triplet of the driving process U w.r.t. this martingale measure is denoted 
by (bs, b^ + G^(s), From [14, Theorem III.3.24] we know that there exist a 
parameter 0 eR and a function p(z; 0), z G M, under certain conditions, such that 

[ \z(p(z;0)-l)\l(dz) <oo, (15) 

-'{^<kl<i} 

ae=a + (b^ + G^Ce)) 0 + [ z (p(z; 0) - 1) t(dz), and (16) 

^ ^ J{s<\z\<l} 

lg(dz) = l{|z|>.}P(z; 0)l(dz). (17) 



Let us assume that 0 solves the following equation 

ae-r + l(b^ + g2(£)) + [ (e^ - 1 - zl(|,|<i)) Udz) = 0, (18) 

^ ^ ^ Jrq 

then is a martingale under F. From now on we indicate the solution of (18) — when 

it exists — as 0g and the martingale measure as F©^ . 
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The relation between the original measure P and the martingale measure is 
given by 



^1 _ = exp (^bOeWt - + G{s)&eWt - 

+ [ [ logipiz-, ©s))N{ds,dz) 

Jo J{\z\>s] 

+ t f (log(p(z; &e)) + 1 - P(z: 0£))£(dz)) . 

J{\z\>e] / 

The processes VK®®, W®®, and A^®* defined by 

dwf ' = dWt - b©edt, 

= dWt - G{s)©edt, 

7V®'(df, dz) = N{dt, dz) - p(z; &e)i{dz)dt 

= N{dt, dz) + (1 - P(z; &e))l(dz)dt, (19) 

for all r G [0, r] and z G {z G M : |z|^> £}, are two standard Brownian motions and 
a compensated jump measure under P©^ (see e.g. Theorem 1.33 in [17]). Hence the 
process is given by 



dSf = S^bdwf'^ + (e^-l)A^®^(d^,dz) + 5fG(e)dwf^ (20) 

J{\z\>s] 

We consider an -measurable and square integrable random variable Hj which 
is the payoff of a contract. The discounted payoff is denoted by The 

GKW-decomposition of under the martingale measure P@^ equals 

_ rT 

= E®«[77^] + / ^f«dAf + (21) 

Jo 

where is the expectation under P©^, is a predictable process for which 
we can determine the stochastic integral with respect to and is a square 
integrable P©^ -martingale with = 0, such that is P©^ -orthogonal to . 
The value of the discounted portfolio for the RM strategy is defined by 

y&e ^ G [0, T]. 

From the GKW-decomposition (21) we have 

= y®* + [ vt e [0, T], 

Jo 



( 22 ) 
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Moreover since is a -martingale, there exist processes and 

such that 

= I xf^dWf^-\- [ [ Yf^(z)N®^(ds,dz) + [ zf^dwf^ g [0, T]. 

Jo Jo J{\z\>s} Jo 

The P( 9 ^ -orthogonality of and implies that 

+ [ F®^(z)(e" - l)p(z; 0,)£(dz) -h Z^^G(s) = 0. (24) 

J{\z\>s} 

Combining (20) and (23) in (22), we get 



dV. 



0. 



: -h xf^) dwf^ -h [ - 1) -h T®^(z)) N^^idt, dz) 

^ ^ J{\z\>s} ^ ^ 

-h G(£) -h dwf^. 



Let jr denote the amount of wealth invested in the discounted risky asset 

in the quadratic hedging strategy. We conclude that the following BSDEJ holds for 
the RM strategy 

dv = Af Mirf ' + / (df , dz) + cf^dwf ‘ , 

J[\z\>s} (25) 



where 



A&e = fj.0e]y + B®*(z) = 7r®"(e" - 1) + F®«(z), and (26) 

C®^ =7r®^G(e) + Z®^ 



Since the random variable is square integrable and -measurable we know 
by [20] that the BSDEJ (25) has a unique solution This 

results from the fact that the drift parameter of equals zero under and thus 
is Lipschitz continuous. 



3 Robustness of the Quadratic Hedging Strategies 

The aim of this section is to study the stability of the quadratic hedging strategies 
to the variation of the model, where we consider the two stock price models defined 
in (1) and (13). We study the stability of the RM strategy extensively and at the end 
of this section we come back to the MVH strategy. Since we work in the martingale 
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setting, we first present some specific martingale measures which are commonly 
used in finance and in electricity markets. Then we discuss some common properties 
which are fulfilled by these measures. This is the topic of the next subsection. 



3.1 Robustness of the Martingale Measures 

Recall from the previous section that the martingale measures P( 9 q and are deter- 
mined via the functions p(. ; 0 o), p(. ; Os) and the parameters 6 ) 0 , 6 )^, respectively. 
We present the following assumptions on these characteristics. 

Assumptions 1 For Oq, 0^, p(. ; Oq), and p(. ; Gg) satisfying Eqs.(2)-(4), and 
Eqs. (15)-(18) we assume the following, where C denotes a positive constant and 

0 G {00, 0g}. 

(i) 00 and 0 ^ exist and are unique. 

(ii) It holds that _ 

100-0.1 <CG2(6), 

where G(£) = max(G(£), cr(s)). Herein a (e) equals the standard deviation of 
the jumps of L with size smaller than s, i.e. 

(T^{e)= [ z^Kdz). 

Ji\z\<s] 

(iii) On the other hand, 0^ is uniformly bounded in s, i.e. 



|6).| <C. 



(iv) For all z in ||z| < 1} it holds that 

Ipfc 6))| <C. 



(v) We have 



L 



{kl>D 



p\z; &)mz) < C. 



(vi) It is guaranteed that 



l(dz) < C. 



[ {l-p{z;0)f 

JM.Q 
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(vii) It holds for k e {2, 4) that 




z; 0o) 



p(z; &e)fmz) < CG'^Hs). 



Widely used martingale measures in the exponential Levy setting are the Esscher 
transform (ET), minimal entropy martingale measure (MEMM), and minimal mar- 
tingale measure (MMM), which are specified as follows. 

• In order to define the ET we assume that 



i 



Q^^i(dz) < 00 , yOe 



{kl>i} 



(27) 



The Levy measures under the ET are given in (3) and (17) where p(z; 0) = e®^. 
The ET for the first model is then determined by the parameter satisfying (4). 
Eor the second model the ET corresponds to the solution 0^ of (18). See [13] for 
more details. 

• Let us impose that 

[ <00, V6> G M, (28) 

J{\z\>l} 

and that p(z; 0) = in the Levy measures. Then the solution 0q of 

Eq. (4) determines the MEMM for the first model, and 0^ being the solution of 
(18) characterises the MEMM for the second model. The MEMM is studied in 
[ 12 ]. 

• Let us consider the assumption 




e"^^i(dz) < 00 . 



(29) 



The MMM implies that p(z; 0) = 0(e^ — 1) — 1 in the Levy measures and the 
parameters 0q and 0g are the solutions of (4) and (18). More information about 
the MMM can be found in [1, 10]. 

In [4, 6] it was shown that the ET, the MEMM, and the MMM fulfill statements (i), 
(ii), (iii), and (iv) of Assumptions 1 in the exponential Levy setting. The following 
proposition shows that items (v), (vi), and (vii) of Assumptions 1 also hold for these 
martingale measures. 

Proposition 1 The Levy measures given in (3) and (17) and corresponding to the 
ET, MEMM, and MMM, satisfy (v), (vi), and (vii) of Assumptions 1. 

Proof Recall that the Levy measure satisfies the following integrability conditions 
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/ z^l(dz) < 00 and / i(dz) < oo. (30) 

We show that the statement holds for the considered martingale measures. 

• Under the ET it holds for 0 G {0q, 0s) that 

p^(z', 0) = 

because of (hi) in Assumptions 1. By the mean value theorem (MVT), there exists 
a number 0' between 0 and 0 such that 

(1 - p(z; 0)f = C, 

where we used again Assumptions 1 (hi). For k g {2, 4}, we derive via the MVT 
that 

(p(z; 0o) - p{z; &e)f = (l - - 0.)^ 

where 0^' is a number between 0 and 0^ — 0o- Assumptions 1 (ii) imply that 

(p(z; 0o)-piz; 0e)f < + l{|^|>i)e^^®«+‘+^^^) CG^’^(s). 

The obtained inequalities and integrability conditions (27) and (30) prove the 
statement. 

• Consider the MEMM and 0 G {0o, 0g}. We have 



because of (iii) in Assumptions 1 . The latter assumption and the MVT imply that 
(1 - Piz-. 0))^ = (e' - 



We determine via the MVT and properties (ii) and (iii) in Assumptions 1 for 
k G {2, 4} that 

(p(z; 0o) - piz\ 0e)f 

^ gJ:0o(e^-l) _ g(0,-0o)(e^-l) 

= - 0ef 
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From (28) and (30) we conclude that (v), (vi), and (vii) in Assumptions 1 are in 
force. 

• For the MMM we have 

P^{z- 0 ) = (0(e" - 1) - 1)"^ < C(e^" + 1). 



Moreover it holds that 

(1 - p{z\ 0)f = (e^ - 1)^02 < (l(|,|<ijeV + l{|z|>i)(e2" + d) C. 

We get through (ii) and (iii) in Assumptions 1 that 

(p(z; 0o) - p(z; 0.))'^ = (e^ - l)^(0o - &ef 

- (l{|z|<i)S^^^ + + 1)) C&^ie), 

for k G {2, 4}. The proof is completed by involving conditions (29) and (30). □ 



3.2 Robustness of the BSDEJ 



The aim of this subsection is to study the robustness of the BSDEJs (11) and (25). 
First, we prove the -boundedness of the solution of the BSDEJ ( 1 1 ) in the following 

lemma. 

Lemma 1 Assume point (vi) from Assumptions 1. Let , A ^^ , be the solu- 
tion of (11). Then we have for all t e [0, T] 



E 



u: 






rT 



+ E 



I 



(Af»)2d^ 



+ E 



r f (Bf°(z)fi(dz)ds 

Jt JRq 



< CE[7/^] 



where C represents a positive constant. 

Proof Via (5) we rewrite the BSDEJ (11) as follows 



di>®o = (-b&aAf^ + [ 

V Jm.q 



(z)(l-p(z;0o))£(dz) dr 



+ Af<^dWt + 



[ Bf\z)N(dt,dz). 

i/Mn 



We apply the Ito formula to and find that 
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= + 2e^'y®“ (-b@oAf<> + [ - p{z; ©o))£(dz) ) df 

\ ^Ro / 

+ 2&(^‘vf°Af°dW, +&^‘(Af°fdt 

+ [ ef^‘ + Bf°(z)Y -(V,^°)AN(dt,dz)+ f ef^\Bf°(z)fl(dz)dt. 

Jrq \ ^ ^ / Jrq 

By integration and taking the expectation we recover that 

E p'(y®«)2 

= E p^(yf«)2] - PE ef^^ivf^fds 

- 2E 1^^ e^"y®o + jT Bf%z)(l - p{z; 0o))£(dz)^ d^ j 

-E [ e^^(Afo)2d^l -E [ [ ef^^(Bfo(z)fl(dz)ds 

-J t j _Jt Jrq 

Because of the properties 



(31) 



for all (2, Z? G M and k g M.t it holds that zb lab < ka^ + -b^ (32) 

k 



and 



for all ^ G N and for all <2/ g M, / = 1 , . . . , ^ we have that 

the third term in the right hand side of (31) is estimated by 
r rT 



( n \ ^ n 

,=i / i=i 



(33) 



5f»(z)(l-p(z;0o))^(dz)|dj 



)' 



< E 



< ^E 



2 



: [ (-Z?(9oAfo + / 

Jt \ JRq 

+ i (^-Z>0oAf» + sf»(z)(l - P(z; 0o))^(dz)^ 
e^*(y®»)^d^j + ^Z7^0 o^E jI" e^"(Af»)2d^j 

/ (l-p(z;0o))"€(dz)Erre^^ / 

J Rq -'Z ? 'Z Rq 



ds 



(Bf^(z)fl(dz)ds 



Substituting the latter inequality in (31) leads to 
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i^~lL (l-^(2’®0))^^(dz) 



n^- 

< E 



hiy‘L 



(Bf^{z))^£{dz)ds 



Let k guarantee that 
2 



1 - yb^&o^ > ^ and 1 - 7 / (1 - p(z; &o)fmz) > 7. 

k jRn 2 



(34) 



Hence we choose 



k>4max (b^0o^, / (1 - p(z; 6»o))^€(dz) ) > 0, 

V JRo / 



which exists because of (vi) from Assumptions 1. Besides we assume that > 
k + ^ >0. Then for 5’ G [0, T] it follows that 1 < < e^^ and from (34) we 

achieve 



E / (Vfo)^di +E / (Afo)2di +E // (Bf»(z))2£(dz)di < CE[(V®“)2] 

Jt ] \_Jt J \_Jt Jrq J 



r rT 



+ E f ( 



which proves the claim. 



□ 



In order to study the robustness of the BSDEJs (11) and (25), we consider bc^h 
models under the enlarged filtration F since we have for all r G [0, T] that C ^t- 
Let us define 

y« = y®0 _ yOe ^ ^ ^00 _ ^0e ^ = B®0(z) - l(|^|>s)B®^(z). 



We derive from (5), (11), (19), and (25) that 



dy/ = afdr + AfdWf + 



/ Bf 

JRo 



(z)N{dt,dz)-CpdWt, 



(35) 



where 

a* = -B(0 qA®o - 0jA®O + G(e)0jC®' (36) 

+ [ (B®«(Z) (1 - P(z; 0o)) - l(|,|>,)B®^(z) (1 - P{z; 0,))) l{dz). 

JRq 
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The process (36) plays a crucial role in the study of the robustness of the 
BSDEJ. In the following lemma we state an upper bound for this process in terms of 
the solutions of the BSDEJs. 

Lemma 2 Let Assumptions 1 hold true. Consider as defined in (36). For any 
t G [0, r] and G M /Y holds that 



E 



<c(g'^{s)\e f e^^iAfofds +E / {Bf°{z)fl{dz)ds 

V I LJt J tJt Jm.0 



+ E 



+ E 



+ E 

T 



[ e^^{A^^fds]+E \ [ / {Bl(z)fl(dz)ds 

Jt J Ut JRo 



where C is a positive constant. 

Proof Parts (ii) and (iii) of Assumptions 1 imply that 

\-b (0oAfo - 0.Af')| < |fo||0o - 6>.||Af«| + \b\\&e\\Af^ - Af 
<CG2(e)|Afo|+C|A!| 



and 

|G(e)0,cf*| < C|Cfl. 



Erom Holder’s inequality and Assumptions 1 (vi) and (vii) it follows that 
[ (BfHz) (1 - p{z; 00)) - l{\z\>e]Bf^{z) (1 - P{z\ 0.))) l(dz) 

JRo 



f 

JRc 



Bf\z) {p{z\&o)- p{z\&e))l{dz) 



{p{z;&o)-p(z;&e)fmz)y iBf%z)}h(dz)^ 

h (1 - p{z; 0e)fmz)^ ' {Bt{z)fl{dz^ ' 

\i/2 / [ N.1/: 

j +c\^f(Bliz))h(dz)j 



+ 



/ 

jRr 



BUz)(l-p(z-,0e))mz) 



< CGHs) ( f (Bf0(z))h(dz) 
\Jr 
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We conclude that 






C (g\s) I (Af0)2 + f 

\ J'Rq 



+ {Aif + / {Bi{z)Ytm + (cro 



/ 

JM 



f-f) 



The statement is easily deduced from this inequality. □ 

With these two lemmas ready for use, we state and prove the main result of this 
subsection which is the robustness of the BSDEJs for the discounted portfolio value 
process of the RM strategy. 

Theorem 1 Assumptions 1 are in force. Let be the solution of {\\) 

and be the solution of (25). For some positive constant C 

and any t we have 



r fT 1 

J (Vfo - vfY^ds 


+ E 


J (Afo - Afo^ds 


+ E r / V (BfHz) 
\_J t J Mq 


- l{\z\>e]BfYz)f i{dz)ds 



< C (e [{Ht - + G\s)E[H^]^ . 



+ E 




(CfO^d^ 



Proof We apply the Ito formula to e^^(V/)^ 



d + 2e^V/afdr + 2e^V/AfdW, - 

+ e^'(Af)2dr + e^'(cf')2df + [ e^\B^{z)f£{dz)dt 

JM.Q 

+ [ ({Vf_ + - (V/_)2) N{dt, dz). 

JRq ^ ^ 



Integration over the interval [t,T] and taking the expectation under P results into 






E 



^~^^fds 



-2E 






- E 

- E 






\fj 

_Jt JRa 



-E 



&f^YBUz)fl{dz)ds 



e^^(Cf0^ds , 
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or equivalently 



E 



T 



+ E 






+ E 
= E 
< E 



LJt 

e^^(Bliz)fl(dz)ds 

r rT 



+ E 



^ e^^(Cf0^ds 



r rT 



s^^y/ajds 

r rT 



f I 

Jt 

p^(y^)2]-/6E e^"(y/)^d^ -2E ^ 

p^(y^)2] +(A:-;6)E e^^(y/)2ds +\^ e^Halfds , (37) 



where we used property (32). The combination of (37) with Lemma 2 provides 
E [e^'(y/)2] + (^ - k)E e^"(y/)2di j + E e^"(Af)2d^ j 



[/^'" / 



+ 

< E 
+ E 



^f^(z)fi(dz)ds I + 






+ jG\s) 
k 



e^"(Afo)^ 






(38) 



{Bf^iz)fi{dz)ds 



Let us choose k and P such that 1 2 and P — k >^. This means we choose 

k >2C > Oand)0 > ^ k > 0. Thus for any 5’ G [t, T] it holds that 1 < e^'^ < e^^. 
We derive from (38) that 



- rT 

E • 



+ E 
< C 



(y/)M^ 

77' 

t JRi 



+ E 



I 



(A^^di 



(Bf‘(z)f£(dz)ds 



+ E 



(cfo^dj 



(^E[(y77 + G7e) E ^ (Af0)2d^j+E ^ ^ (fifo(z))7(dz)d^ j 



By Lemma 1 we conclude the proof. □ 

This main result leads to the following theorem concerning the robustness of the 
discounted portfolio value process of the RM strategy. 

Theorem 2 Assume Assumptions 1. Let be part of the solution of (11), 

(25) respectively. Then we have 
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E 



(y©0 _ Y&sf 



sup 

o<t<r 



< C (E[(Hr - + G^(6 )E[h2]) 



for a positive constant C. 

Proof Integration of the BSDEJ (35) results into 

r»r pT pT 



v; = Vf 



f aids - j - f j 

Jt Jt Jt JMc 

By property (33) we arrive at 



Bl{z)N{ds,dz)+ Cf^dW, 



/ 



E 



sup (P/)2 

o<t<r 



< 5 



/ r o"i 






El(r|)2| + E 


Jo 


+ E 



sup 

o<t<r 






+ E 



sup 

o<t<r 



{[ L “ 9 ] ^ ^ isir {f 



Burkholder’s inequality (see e.g., Theorem 3.28 in [15]) guarantees the existence of 
a positive constant C such that 



E 



E 



E 



sup 

o<t<r 



sup 

o<t<r 



,S!rU,‘ 

(T [ Bl{z)N{ds,dz)) 
\J t J Mo / 

{[cf.,w) 



(Af)2ds 




1 r 


' fT f 


< CE 


/ / (Bj(z))2£(dz)d^ 


J L 


. J 0 J Mq 



cf« dir, I < CE / (Cfo^ds 



Application of Lemma 2 for r = 0, ^0 = 0, Lemma 1, and Theorem 1 completes the 
proof. □ 



3.3 Robustness of the Risk-Minimising Strategy 

Theorem 2 in the previous subsection concerns the robustness result of the value 
process of the discounted portfolio in the RM strategy. Before we present the stability 
of the amount of wealth in the RM strategy, we study the relation between 
(resp. 71^0 and the solution of the BSDEJ of type (11) (resp. (25)) in the first 
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(resp. second) model. Consider the processes and defined in (12), then 

it holds that 

- l)/o(z; 0o)Udz) 

= + X^^b + [ (;r®o(e" - p{z', @o) + y®°(z)(e" - l)p(z; &o)) l(dz) 

Jro 



= 7T 



00 



b^+ / (e^ - l)V(z; 0o)€(dz) 



+ + [ F®o(z)(e^ - l)p(z; 0o)€(dz). 

Jro 



/Rq 

From property (10) we attain that 

>00 



7T 



— [A'^^b + f B^\zW - l)p(z; 0o)^(dz) I , 
'^o V Jro / 



(39) 



where kq = b^ + /iRu(e^ — \)^p{z\ 0o)^(dz). Similarly for the second setting we 
have for the processes and defined in (26) that 



A®'h + 


J{\z\>s} 


B^^izW 


= 7T ^ • 


b^ + 


f (e^ 






'{|z|>«) 


+ X®'h + 1 


f y0£ 









Property (24) leads to 

if®' = - [A^^b + [ B®'(z)(e" - l)p(z; 0,)£(dz) + C®'G(e) ) , (40) 

V J{\z\>s] / 

where = b^ + /(|z|>£)(e^ ~ &s)^{dz) + G^(e). 

We introduce the following additional assumption on the Levy measure which we 
need for the robustness results studied later. 

Assumption 2 For the Levy measure I the following integrability condition holds 
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Note that the latter assumption, combined with (30), implies for k e {2, 4} that 



/ 

JR( 



(e"-lf€(dz) 



V-'ii 



z^l(dz) + 



{kl<i} 



[ i(dz) + [ c^Hidz)) 

J{\z\>l} J{z>l} ) 



< 00 . 



(41) 

Moreover Assumption 2 is fulfilled for the considered martingale measures 
described in Sect. 3.1. Indeed, consider the ET, applying (27) for 0 = 4 and restrict- 
ing the integral over {z > 1} implies Assumption 2. On the set {z > 1} it holds that 
z < e^ — 1 and therefore Assumption 2 follows from (28) by choosing 0=4. For 
the MMM, condition (29) corresponds exactly to Assumption 2. 



Theorem 3 Impose Assumptions 1 and 2. Let the processes and denote 
the amounts of wealth in a RM strategy. There is a positive constant C such that for 
any t G [0, T] 



E 




< C (e[(Ht 






Proof Consider the amounts of wealth in (39) and (40). Let us denote 
and = — T^. Then it holds that 

K'S 



(^00 _ < 2 ( r °)2 + 3 -( r ° - . 

yV KOKe / Kj j 

Herein we have because of the Holder’s inequality, (14), (41), and properties (iv) and 
(vii) in Assumptions 1 that 



a 



+ 

3 



(e^ - ifpiz; 0o)l(dz) 



(e^ - \f(p(z\ 0o) - p{z\ ©e)mdz) I + G\e) 






(e^ - lf£(dz) 



(d/ 

[ (e^ - iVudz) f 

J Mn J M 



+ / (e^ - irudz) / (P(z; ©o) - P(z; ©,))" i(dz) + G\e) 



< CG^^ie). 
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On the other hand it is clear from (39) and (40) that 



^^0 'Y'S '^2 



< 3 



+ ( [ (B®»( 2 )(e^ - l)p{z\ 0o) - lj|^|>,)B®nz)(e^ - l)p{z\ 0,))€(dz) 



)> 



Herein we derive via Holder’s inequality, (30), (41), and points (iv), (v), and (vii) in 
Assumptions 1 that 



(/< 

w Mo 






(fi®o(z)(e^ - l)p(z; 0o) - l(|,|>.)fi'=''(z)(e^ - l)p(z; 0.))£(dz) 

= ( [ (B^o{z)(p(z; 0o) - P(z; 0,))(e^ - 1) + B®(z)p(z; 0.)(e^ - l))€(dz)) 

VMo / 

/ 

JM.0 



< f (B®o(z))2£(dz) / (p(z; 0o) - P(z; 0,))^(e^ - l)2£(dz) 

*/Mo 



[ (B^(z)fl(dz) [ 

J Mo J M 



+ / (fi®(z))^£(dz) / p^(z; 0.)(e^ - l)^£(dz) 

Mo 

< [ (B^Hz)fmz) ( [ (p(z; 0o) - P(z; 0,))^€(dz) 

Jm.() 



I 

Jm.0 



+ 1 (BHz)fmz) 



a 

a 



)’(/. 



(e^ - l)^€(dz) 



P^(z; 0,)£(dz) f (e^ - l)^£(dz) ' 
{|zl>D •^(lzl>i) J 



+ c [ (B^z)fmz) [ z^mz) 

Jko >/{kl<i) 

<CG\s) I (fi®o(z))^£(dz) + C / (B‘^(z)fi(dz). 

Jro 



^Mo 



] 



The results above show that 



(rtfo - < C ^ (Bf 



‘^(z))h(dz) + (Cf^f 



+ G\e) 



(Af°f + 



f (Bf«(z))^€(dz)!) 
jRo \J 
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Therefore 



[/ 



E| / 

rT 






+ G\s) 



E 



+ E 

■T 



1/7 

Jt JR( 



(BHz))h{dz)ds 



+ E 



J\cf^fds'j 



^ (Afo)2d^ +E ^ l^(BfHz))h(dz)dsJ^y 



By Lemma 1 and Theorem 1 we conclude the proof. □ 

The trading in the risky assets is gathered in the gain processes defined by Gf ^ = 
/o and Gf^ = The following theorem shows the robustness of 

this gain process. 

Theorem 4 Under Assumptions 1 and 2, there exists a positive constant C such that 
for any t e [0, T] 



E 




< C (e[(777- 



H^f] + G^{s)E[H^]^ . 



Proof From (5) and (6) we know that 



^f^dSs = ^f^SsbdWf^ + [ (e" - l)Af®o(d^, dz) 

JRo 






S, dz)^ . 






-b^&o + [ (e^ - 1) (1 - p{z; 0o)) ^(dz) | d^ 

JRq 

-\-bdWs-\- (e^-l)A^(d^, 

JRo 



In the other setting we have from (19) and (20) that 



^f^dst = ^f^s^bdwf^ + [ 

J{\z\>s} 



((-b^&e + 


/ (e^ 


w 




+ bdWs + 


/ (e" 







(e^ - l)Ar®«(ds, dz) + ^f"7®G(e)dWf' 
1) (1 - p(z; 0,))£(dz) - G^(s)&e^ d^ 
l)Af(d5, dz) + G(e)dwd . 
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We derive from the previous SDEs that 

Gf« - Gf‘ = f - f 

Jo Jo 



= (-b^@0 + [ (e^ - 1) (1 - P(z; ©o))^(dz)) [ itf^ds 
\ JRo ) Jo 

- (-b^0e + [ (e^ - 1) (1 - P(z; @e)) l(dz) - G\e)i 

\ J[\z\>s] 

+ b f (;rf» - TrfOdW, + f f (;rf»(e^ - 1) - ;rf' l(|,|>,,(e^ - 1)) N(ds, dz) 
Jo Jo JRq 

- g(£) f‘ a f^dWs- 

Jo 

Via the Cauchy-Schwartz inequality and the Ito isometry we obtain that 
E - Gf^)^j 



(-b^&e+ [ 

V J{\z 



b^0o + (e^ - 1) (1 - P(z; ©o)) mz) 



) 



- I -b^@s + (e^ - 1) (1 - p{v, &e)) mz) - G^(S)0, 

Mlz|>e) 



)F 



+ E 

(-b^0e+ f 
V Jdz 



X I -b^e + I (e" - 1) (1 - Piz; 0e)) mz) - G\s)0. 

>{\z\>s] 






+ b^El / (irf 0 - ;rf o^ds 



+ G^(e)E 



[/' 






+ E 

wherein 

E 



lo Ir “ 1) “ - l))^^(dz)dsj^ , 



< 2E 

< 2 



/ / - 1) - jzf'l(|j|>s)(e^ - l))^£(dz)dJ 

Jo J Mo J 

/ / ((3zf'>)^(e^ - l)^l{|2|<s) + (^f“ - - l)^l(|z|>s)) €(dz)di 

Jo J Mq 

£(;rfo)2d^] 



U 

L 



(e" - l)^^(dz)E 



(|z|<f^) 



+ / (e" - l)^£(dz)E 



J‘(nfo - , 
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and 



E 


[ (nf^fds 


< 2E 


[ (jcf^ — jxf^)^&s 


+ 2E 


finfofds 




Jo 




Jo 




Jo 



By relation (14), Assumptions 1, (39), (41), Lemma 1, and Theorem 3 we prove the 
statement. □ 

The following result shows the robustness of the process appearing in the 
GKW-decomposition. This plays an important role in the stability of the cost process 
of the RM strategy. 

Theorem 5 Let Assumptions 1 and 2 hold true. Let the processes and 
be as in (9) and (23), respectively. For any t G [0, T] it holds that 

< C (e[(777’ - Hrf] + G^(s)E[H^]^ , 

for a positive constant C. 

Proof By (5) we can rewrite (9) as 

d^0o ^ (-be^xf^ + [ F®«(z)(l - p(z; 0o))£(dz)) dr 

V Jro / 

+ xf°dWt+[ F®«(z)Af(dr,dz). 

JRq 

and similarly by (19) we obtain for (23) 

d^®' = (-bOeXf^ + [ Ff'(z)(l - p{z; 0,))£(dz) - G(e)0,zf4 dr 
V Ji\z\>s] / 

+ xf^dW,+ [ Yf^iz)N(dt,dz) + zf^dWt. 

Ji\z\>s] 



Hence we recover that 

d(^®“ - = F/d? + XfdWt + f Yf(z)N(dt, dz) - zf^dWt, 

JRq 



where 

+ G(e)0^Z®® 

+ [ (F®«(z)(l - p{z; 0o)) - l(|,|>.)F®n^)(l - Piz-, &e))) Udz), 

Jm.q 

r(2) = F®o(z)-l(|^|>,)F®^(z). 
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By integration over [0, r] and taking the square we retrieve using (33) that 

2 



< c 



^(f [ +(f 

\J0 JRn / V-/0 



zf^dW, 



Y^iz)N(ds 

Via the Cauchy-Schwartz inequality and the Ito isometry it follows that 



)> 



E[(^®« - < C 



(y/)"d^ 



/ 

Uo 



+ E / (V)^ds 



+ E 



\fl 

Uo JRi 



(Zf(z))2£(d2)d^ 



+ E 



I/^]) 



Concerning the term E (y/)^d5'j we derive through (ii) and (hi) in Assumptions 



1 that 



Ey\&oXf^-0,xf^fds 

< 2 ^E [^'(00 - &ef(Xf<^fds 

< C ^G"^(e)E 



(Zfo)^ds 



J‘&eHxfo 



■ zfo^d^ 



]) 



and via (vi) and (vii) in Assumptions 1 it follows that 

f 1 / (1 - Piz-, 0o)) - l{|z|>.)Zfn2) (1 - Piz-, 0s))] mz) 

Jo I J Mo 

[ (Piz; 0o) - Piz; 0s)f HdzW \ f [ (rf'^izifudzids 

J Mq -Jo J Mq 

f [ il - Piz; 0s)f tidz)E\ f [ 

Jrq Uo Jr 






(Ff(z))2£(dz)d^ 



<c(G4(e)Er/ [ iYf^iz)fUdz)ds]+E\ [ f (F/(z))2£(dz)ds]) . 

V Uo Jrq j L^o Jrq J/ 
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Thus we obtain that 






<c(g‘^(s)\e f (Zfo)2d^ +E f [ (Yf<>iz)fi(dz)ds 

V I Lio J Lio Jm.0 



+ Er [\xlfds]+E\ f [ 
Uo J Uo Jm 



(F5(z))2£(dz)ds 



+ E 



(ZfO^ds 



L70 



]) 



(42) 



Let us consider the terms appearing in the latter expression separately. 
• Definition (12) implies that 



E 



J^ixf^fds <2^^j^{Af^fds +b^E j^iTtf^fds^ 



and 

E 
< 2 



r f f (Yf\z)fl{dz)ds 
L70 7Mo 

("ill 



{Bf^{z)?l{dz)ds 



/ 



+ / (e" - l)2£(dz)E 






Combining (12) and (26) in 



zf = zf® - zf^ = if - (irf® - Tif‘)b, 



it easily follows that 
L7o 

Similarly, from (12) and (26) we find 



(Xf)2ds < C ^E (Af)2ds + E (:rfo - :rf0^ds . 



Yf(z) = Yf\z) - Yf^iz) = B^(z) - (:rf« - :rfO(e^ - D- 



Hence 

E 



(f/(z))2£(dz)d^ 






(S!(z))2£(dz)d5 



+ 



[ (e^ - lf£(dz)E \ - irfO^dd) 

7ro Uo J/ 
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From (26), the estimate 

< c ((Cf^)2 + (7rf« - 



leads to 

e\ [\zf Hz) fds 
Jo 



< C 



HI! 



{CfH'^As 



+ G^(e)E 






+ G^(e)E| / {nf^ -TtfH'^ds 



Because of (39) and (vi) in Assumptions 1 we notice that 
rt 



E [ / (;rf»)2dd < C (e T / (Afo)2dd + E [ / / (Bfo(z))2£(dz)dsl) 

Jo j \ Uo J Uo Jro J/ 

Using (41) and the combination of the above inequalities in (42) show that 

E[(^®“ < C (g2(£) [e / (Af0)2di +E [ [ (Bf0(z))2£(dz)di 

V I UO J L^o Jrq 



+ E I / (Af)2d£ 



/ 

Uo 



+ E / (Cf')2di 



H- IE 
+ E 



[/'/ 

Uo Jr 

[f 



(Bl(z))H(dz)ds 



(7tf° - Tifn^ds 



] 

'])■ 



Finally by Lemma 1 and Theorems 1 and 3 we conclude the proof. 



□ 



The cost processes of the quadratic hedging strategy for Ht , are defined by 

K^Q, — ^00 y^0o The upcoming result concerns the 

robustness of the cost process and follows directly from the previous theorem. 

Corollary 1 Under Assumptions 1 and 2, there exists a positive constant C such 
that it holds for allt that 

E[(/:f'o - < C (e[{Ht - Hj)^] + G^(e)E[H^]^ . 

Proof Notice that 



E[(/s:f'’ - Kf‘)^] < 2 



(e[(^®« 



- + E[(Vn®° - V^‘ 



)^]) 
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wherein 

sup . 

0<t<T 

Theorems 2 and 5 complete the proof. □ 



E[(V(fo - < E 



3.4 Robustness Results for the Mean-Variance Hedging 

Since the optimal numbers and of risky assets are the same in the RM and 
the MVH strategy, the amounts of wealth ^nd and the gain processes 
and also coincide for both strategies. Therefore we conclude that the robustness 
results of the amount of wealth and gain process also hold true for the MVH strategy, 
see Theorems 3 and 4. 

The cost for a MVH strategy is not the same as^for the RM strategy. However, 
under the assumption that a fixed starting amount Vq is available to set up a MVH 
strategy, we derive a robustness result for the loss at time of maturity. For the models 
(1) and (13), it holds that the losses at time of maturity T are given by 

L®o = Hr-Vo-/ 

Jo 

= H^-Vo- [ 

Jo 

When Assumptions 1 and 2 are imposed, we derive via Theorem 4 that 
E[(L®o - < C (e[(Ht - H^f] + , 

for a positive constant C. 

Note that we cannot draw any conclusions from the results above about the robust- 
ness of the value of the discounted portfolio for the MVH strategy, since the portfolios 
are strictly different for both strategies. 



4 Conclusion 

Two different geometric Levy stock price models were considered in this paper. We 
proved that the RM and the MVH strategies in a martingale setting are stable against 
the choice of the model. To this end the two models were considered under different 
risk-neutral measures that are dependent on the specific price models. The robustness 
results are derived through the use of BSDEJs and the obtained -convergence rates 
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are expressed in terms of estimates of the form E[(Ht — H ^)^] . The latter estimate is 
a well studied quantity, see [3, 16]. In the current paper, we considered two possible 
models for the price process. Starting from the initial model (1) other models could 
be constructed by truncating the small jumps and possibly rescaling the original 
Brownian motion (cfr. [8]). Similar robustness results hold for quadratic hedging 
strategies in a martingale setting in these other models. 

In [8] a semimartingale setting was considered and conditions had to be imposed 
to guarantee the existence of the solutions to the BSDEJs. In this paper however, 
we considered a martingale setting and, since there is no driver in the BSDEJs, 
the existence of the solution to the BSDEJs was immediately guaranteed. On the 
other hand, since the two models were considered under two different martingale 
measures, we had to fall back on the common historical measure for the robustness 
study. Therefore, a robustness study of the martingale measures had to be performed 
and additional terms made some computations more involved compared to the semi- 
martingale setting studied in [8]. 

In this approach based on BSDEJs we could not find explicit robustness results for 
the optimal number of risky assets. Therefore we refer to [6] , where a robustness study 
is performed in a martingale and semimartingale setting based on Eourier transforms. 
Note that in [6] robustness was mainly studied in the -sense and the authors noted 
that their results can be extended into -convergence, whereas L^-robustness results 
are explicitly derived in the current paper. 
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Risk- Sensitive Mean-Field Type Control 
Under Partial Observation 

Boualem Djehiche and Hamidou Tembine 



Abstract We establish a stochastic maximum principle (SMP) for control problems 
of partially observed diffusions of mean-held type with risk- sensitive performance 
functionals. 

Keywords Time inconsistent control • Maximum principle • Mean-held SDE • 
Risk- sensitive control • Partial observation 

AMS subject classification: 93E20 • 60H30 • 60H10 • 91B28. 



1 Introduction 

In optimal control problems for diffusions of mean-held type the performance func- 
tional, drift and diffusion coefficient depend not only on the state and the control but 
also on the probability distribution of the state-control pair. The mean-held coupling 
makes the control problem time-inconsistent in the sense that the Bellman Princi- 
ple is no longer valid, which motivates the use of the stochastic maximum (SMP) 
approach to solve this type of optimal control problems instead of trying extensions of 
the dynamic programming principle (DPP). This class of control problems has been 
studied by many authors including [1, 2, 5, 7, 15, 20]. The performance functionals 
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considered in these papers have been of risk-neutral type i.e. the running cost/profit 
terms are expected values of stage-additive payoff functions. Not all behavior, how- 
ever, can be captured by risk-neutral performance. One way of capturing risk-averse 
and risk- seeking behaviors is by exponentiating the performance functional before 
expectation (see [17]). 

The first paper that we are aware of and which deals with risk- sensitive optimal 
control in a mean field context is [24]. Using a matching argument, the authors 
derive a verification theorem for a risk-sensitive mean-field game whose underlying 
dynamics is a Markov diffusion. This matching argument freezes the mean-field 
coupling in the dynamics, which yields a standard risk-sensitive HJB equation for 
the value-function. The mean-field coupling is then retrieved through the Fokker- 
Planck equation satisfied by the marginal law of the optimal state. 

In a recent paper [11], the authors have established a risk-sensitive SMP for 
mean-field type control. The risk-sensitive control problem was first reformulated 
in terms of an augmented state process and terminal payoff problem. An intermedi- 
ate stochastic maximum principle was then obtained by applying the SMP of ([5], 
Theorem 2.1.) for loss functionals without running cost but with augmented state 
in higher dimension and complete observation of the state. Then, the intermediate 
first- and second-order adjoint processes are transformed into a simpler form using 
a logarithmic transformation derived in [12]. 

Optimal control of partially observed diffusions (without mean-field coupling) has 
been studied by many authors including the non-exhaustive references [3, 4, 8-10, 
13, 14, 16, 19, 21, 23, 26, 27], using both the DPP and SMP approaches. Reference 
[23] derives an SMP for the most general model of optimal control of partially 
observed diffusions under risk-neutral performance functionals. Recently, Wang et al. 
[25], extended the SMP for partially observable optimal control of diffusions for risk- 
neutral performance functionals of mean-field type. 

The purpose of this paper is to establish a stochastic maximum principle for a 
class of risk-sensitive mean-field type control problems under partial observation. 
Following the above mentioned papers of optimal control under partial observation, 
in particular [23], our strategy is to transform the partially observable control problem 
into a completely observable one and then apply the approach suggested in [11] to 
derive a suitable risk-sensitive SMP. To the best to our knowledge, the risk-sensitive 
maximum principle under partial observation without passing through the DPP, and 
in particular, for mean-field type controls was not established in earlier works. 

The paper is organized as follows. In Sect. 2, we present the model and state the 
partially observable risk- sensitive SMP which constitutes the main result, whose 
proof is given in Sect. 3. Finally, in Sect. 4, we apply the risk-sensitive SMP to 
the linear-exponential-quadratic setup under partial observation. To streamline the 
presentation, we only consider the one-dimensional case. The extension to the mul- 
tidimensional case is by now straightforward. Furthermore, we consider diffusion 
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models where the control enters only the drift coefficient, which leads to an SMP 
with only one pair of adjoint processes. The general Peng-type SMP can be derived 
following e.g. [11, 23]. 



2 Statement of the Problem 

Letr > 0 be a fixed time horizon and (^2, F, P) be a given filtered probability 
space on which there are defined two independent standard one-dimensional Brown- 
ian motions W = and Y = Let and be the P-completed 

natural filtrations generated by W and Y, respectively. Set := 0 < 5' < T} 

and F := 0 < 5’ < T}, where, v . 

We consider a mean-field type version the stochastic controlled system with partial 
observation considered in [23] which is an extension of the model considered by 
[4, 14] to which we refer for further details. 

The model is defined as follows. 

(/) An admissible control u is an F^ -adapted process with values in a non-empty 
subset (not necessarily convex) f/ of R and satisfies E[Jq \u(t)\^dt] < oo. We 
denote the set of all admissible controls by The control u is called partially 
observable. 

(//) Given a control process u e "2^, we consider the signal-observation pair 
(x“, T) which satisfies the following SDE of mean-field type 

' dx^(t) = b(t, E[x^(t)l u{t))dt -h a{t, x^it), E[x^{t)])dWt 

+ a{t,x^{t),E\_x^{t)])dWf, v“(0)=vo, (1) 

dYt = ^{t,x^{t))dt + dWf, Fo = 0, 

where, 

b{t, x,m,u) \ [0, T] X R X R X f/ — ^ R, 

a(t, X, m), cf(t, X, m) \ [0, T] x R x R — ^ R 

and P(t,x) : [0, T] x R — ^ R are Borel measurable function. 

In this model, the observation process T, which carries out the controls u, is assumed 
to be a given Brownian motion independent of W and is supposed to admit a decom- 
position as a trend Jq ^(t, x^(t))dt (a functional of the state process x^) corrupted 
by a process which are a priori not observable. The case Ck: = 0 corresponds to 
the model considered in [4, 14]. A more general model of the function p would be 
to let it depend on the control u and be of mean-field type. To keep the presentation 
simpler, we skip this case in this paper. But, the main results do extend to this case. 
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Before we formulate the control problem, we show that the system (1) has a weak 
solution. Introduce the density process defined on (f2, F, P) by 



I 



P(s,x^(s))dY,- 
0 ^ 



-f 

2 Jo 



\P(s,x^{s))\^ds 



p (t) := exp 

which solves the linear SDE 

dp^t) = p^(t)P(t, x^(t))dYt, p"(0) = 1 



Assuming the function p bounded (see Assumption 1, below), p is a uniformly 
integrable martingale such that, for every p > 2, 



E[ sup pf ] < C, 

0<t<T 



( 2 ) 



where, C is a constant which depends only on the bound of ^0, p and T . Define 
= p^{T)dV. By Girsanov’s Theorem, P“ is a probability measure. Moreover, 
is a P“ -standard Brownian motion independent of W . This in turn entails that 
(P“, 7, W, W^) is a weak solution of (1). 

The objective is to characterize admissible controls which minimize the risk- 
sensitive cost functional given by 

J\u(-)) = E“ 

where, 0 is the risk-sensitivity index, 

f(t, x,m,u) \ [0, T] X R X R X f/ — R, 
h(x, m) \ R X R — ^ R. 

Any fi(-) G ^ which satisfies 

J\m)= inf (3) 

is called a risk- sensitive optimal control under partial observation. 

Let f(t,x(t), E^[x(t)], u(t))dt -h h{x(T), E^[x(T)]) and consider 

the payoff functional given by 



exp ( 6» / fit, x^it), E^[x^{t)\ u(t))dt -h h(x^(T), E^[x^(T)]) 



])] 



:= -\ogE"e^'^^. 

U 

When the risk-sensitive index 6 is small, the loss functional can be expanded as 



+ ^var„(*f'r) + 0(02), 
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where, ydiVui^r) denotes the variance of w.r.t. P“. If 0 < 0 , the variance of 
as a measure of risk, improves the performance , in which case the optimizer is 
called risk seeker. But, when 0 > 0, the variance of worsens the performance 
^ 0 , in which case the optimizer is called risk averse. The risk-neutral loss functional 
E^[^t] can be seen as a limit of risk- sensitive functional when 0 ^ 0 . 

Since = p^(T)dV, the associated risk-sensitive cost functional becomes 



r(u(-)) = E 



f(t.x-(t),E[p-(t)x-mu(t))dt+h(x-(T),E\_p"(T)x"(T)])] 



where, on (k2, F, P), the process (p“, satisfies the following dynamics: 



(4) 



dp^(t) = p^(t)P(s,x^(s))dYt, 

dx^(t) = {b(t, x^(t), E[x^(t)l u(t)) - a(t, x^(t), E[x^(t)])P(t, x^(t))}dt 
+ a(t, x^(t), E[x^(t)])dWt + ci(t, E[x^(t)])dYt, 
p"(0) = 1, v"(0)=vo. 

(5) 



We have recast the partially observable control problem into the following completely 
observable control problem: Minimize J^(u(-)) defined by (4) subject to (5). 

The main result of this paper is a stochastic maximum principle (SMP) in terms of 
necessary optimality conditions for the problem (3) subject to (5). 

We will only consider the case where the risk- sensitive parameter is positive, 0 > 0. 
The case 0 < 0 can be treated in a similar fashion by considering 0 = —0,0 > 0, 
and / = — f, h = —h in the performance functional (4). 

We will make the following assumption. 

Assumption 1 The functions b, a, a, f, h are twice continuously differentiable 
with respect to (x,m). Moreover, these functions and their first derivatives with 
respect to (x,m) are bounded and continuous in (x,m,u). 

To keep the presentation less technical, we impose these assumptions although they 
are restrictive and can be made weaker. 

Under these assumptions, in view of Girsanov’s theorem and [18], Proposition 1.2., 
for each u e the SDE (5) admits a unique weak solution (p“, v“). 

We now state an SMP to characterize optimal controls u(-) e ^ which minimize 
(4), subject to (5). Let (p, x) := (p“, x^) denote the corresponding state process, 
defined as the solution of (5). 
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We introduce the following notation. 



\c{t,x,m,u)J \a(t, X, m) cr(t, X, m) J 

c(t, X, m, u) \= b(t, X, m, u) — a(t, X, m)P(t, x), (p(X):=x, ^(X) := px, 
(p(X) :=x, 0(X) := px. 

( 6 ) 

We define the risk-neutral Hamiltonian as follows. For (p, q) G x R^^^, 

H(t, X, m, p, q, u) := {F(t, X, m, u), p) +tr(G*(^, X, m)q) — f(t, x, m, u), (7) 



where, denotes the transposition operation of a matrix or a vector. 

We also introduce the risk- sensitive Hamiltonian: (p, q, 1) x R2x2 X 

H^(t, X, m, u, p, q, £) := {F(t, X, m, u), p) — f(t, x, m, u) 

+ Xi{G*{t,X,m){q+etp*)). 



We have H = H^. 

Setting 

\h) \P 2 j \q 21 q 22 J 

the explicit form of the Hamiltonian (8) reads 



X, m, u, p, q, i) c(t, x, m, u)p 2 — fit, x, m, u) -h pPit, x)iq\\ -h Oi\p\) 
-h a{t, X, m)iq 2 \ + OI 2 P 1 ) + cr(t, x, m)(q22 + 0l2P2)- 

Setting 0 = 0, we obtain the explicit form of the Hamiltonian (7): 



H(t, X, m, u, p, q) \= c{t, x, m, u)p 2 — fit, x, m, u) -h p^it, x)q\i 
-h ait, x,m)q 2 \ -\- cfit, x,m)q 22 - 

With the obvious notation for the derivatives of the functions b,a, p, a, f,h, w.r.t. 
the arguments v and m, we further set 

' H^it, X, m, u, p, q) \= Cxf, x, m, u)p 2 - fxit, x, m, u) -h pfxO, x)iqn -h 0l\pi) 

-h axit,x,m)iq2\-\-0l2Pi)-\-CTxit,x,m)iq22 + 0l2P2), 

■ H^it, X,m,u, p,q) := Cmit , x , m , u) P 2 + Oimit , x , m)iq 2 \ + 0l2P\) 

-h atnit,x,m)iq22 + 0l2P2), 

H^pit, X, m, u, p, q) := fit, x)iqn -h Olipi). 
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With this notation, the system (5) can be rewritten in the following compact form 

dX{t) = F(t, X(t), £[0(Z(f))], u(t))dt + G(t, X(t), E[<p{X(t))])dBr, 

X(0) = Zo. ^ ^ 

We define the risk-neutral Hamiltonian associated with random variables X such that 
4>(X) and0(Z) are L*(i2, P) as follows (with the obvious abuse of notation): 
For (p, G X R^^^, 

H(t, X, p, q, u) := (F(t, X, ^[0(X)], u), p) - f(t, x, E^X)], u) 

+ tr(G*(cX,^[0(X)])^). ^ ^ 



We also introduce the risk- sensitive Hamiltonian: (p, f) g R^ x R^^^ x R^, 

X, u, p, q, £) := {F(t, X, £[0(Z)], u), p) - f(t, x, E[^{X)], u) 

+ tr{G*(t.X,E[(t>{Xmq+9£p*)). ^ ’ 



For g G {b, c, cr, a, fi} and u e U,we set 

gx(.t) := gx{t,x{t), E[x(t)],u(t)), gm(t) := gm(t,x{t), E[x{t)],u(t)) (12) 



and 



fx(t) ■= fx(t , x{t), E[p(t)x{t)], u(l)), fm{t) := fm{t,x{t),E[p{t)x{t)],u(t)), 
hx{t) := hx{x{t), E{p{t)x{t)]), hm{t) := hm{x{t), E[p(t)x(t)]). 

(13) 

Let 



p(T) QxpO 



r rT 




, x(0, E[p{t)x{t)}, u{t))dt + h{x{T), E[p{T)x{T)}) . 



We introduce the adjoint equations involved in the risk-sensitive SMP for our 
control problem. 



dp{t) = - 









E[v\t)H, 
-|-^(^)( — 0£{t^dt -\-dBf^, 
dv^it) = ev^(t){e(t),dBt), 

^ _/(0p(r))->\ _ /i(r)\ j_ 

V hx(T) ) \p(T))rr 



(t)] 

(t)] - |§T[v^f)/m(0]] 



) 



dt 



(14) 



p(T) 

v^^) = 






(r)], 



where, in view of (2) and (13), for ^ g {p, v}. 
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H^it) := {Fkit, X(l), E[<j)(X(l))], u(l)). Hi)) ~ fkH, m, u{t)) 

+ tr(G|(f, Z(0, E{<t>{X{t))mil) + eip*{t)) 



and 



Hiit) := {Emit, nt), E{<t>{X{t))l m(0), Pit)) 

+ tr(G;(r, Xit), EmX{t))])iq{t) + 9lp*it)). 

We note that the processes (p,q,l) may depend on the sensitivity index 0. To ease 
notation, we omit to make this dependence explicit. 

Below, we will show that, under Assumption 1, (14) admits a unique F-adapted 
solution (p,q,v^ ,i) such that 



sup Ip (01 
?€[0,r] 



+ sup 
?€[0,r] 



\v\t)\^ + 






^ + 1 ^( 01 ^) 



dt 



< 00 . 



(15) 



Moreover, 

Lemma 1 The process defined on (T2, F, P) by 

Ly.= ^^=cxp(^J^e(l{s),dB,)-^ \l(s)\^dsy 0<t<T, (16) 

is a uniformly integrable ¥ -martingale. 

The process defines a new probability measure equivalent to P by setting 
:= ^1^ . By Girsanov’s theorem, the process := Bf — 0 £(s)ds, 0 < 

t < T is a. P^ -Brownian motion. 

The following theorem is the main result of the paper. Let denote the expec- 

tation w.r.t. P^ . 

Theorem 1 (Risk-sensitive maximum principle) Let Assumption 1 hold. If the 
process (pf),xf),uf)) is an optimal solution of the risk-sensitive control problem 
(3)-(5), then there are two pairs of V -adapted processes (v^, i) and (p, q) which 
satisfy ( 14 )-( 15), such that 

E^[H\t, p{t), x(t), pit), q(t), lit), u) - H^it, pit), xit), pf), qf). If), uit))\^f] < 0 , 
for all u e U, almost every t and P^ —almost surely. 

Remark 1 The boundedness assumption of the involved coefficients and their deriv- 
atives imposed in Assumption 1, in Theorem 1, guarantees the solvability of the 
system of forward-backward SDEs (5) and (14). In fact Theorem 1 applies provided 
we can solve system of forward-backward SDEs (5) and (14). A typical example of 
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such a situation is the classical Linear- Quadratic (LQ) control problem (see Sect. 4 
below), in which the involved coefficients are at most quadratic, but not necessarily 
bounded. 



3 Proof of the Main Result 

In this section we give a proof of Theorem 1 here presented in several steps. 



3.1 An Intermediate SMP for Mean-Field Type Control 

In this subsection we first reformulate the risk-sensitive control problem associated 
with (4)-(5) in terms of an augmented state process and terminal payoff problem. 
An intermediate stochastic maximum principle is then obtained by applying the 
SMP obtained in ([1], Theorem 3.1 or [5], Theorem 2.1) for loss functionals without 
running cost. Then, we transform the intermediate first-order adjoint processes to a 
simpler form. The mean-field type control problem for the cost functional (4) under 
the dynamics (5) is equivalent to 

inf E , (17) 

u{-)e^ L J 



subject to 

dp{t) = p{t)^(t,x{t))dYt, 

dx(t) = {b(t, x(t), E[x(t)], u(t)) — a(t, x(t), E[x(t)])l3(t, v(Q)} dt 

+ a(t, x(t), E[x(t)])dWt + oi(t, x(t), E[x(t)])dYt, (18) 

d^ = f{t, x(t), E[p(t)x(t)], u(t))dt, 
p(0) = 1, v(0)=vo, §(0) = 0. 



We introduce the following notation. 

" = 0 “(O’ "=(f )=(!)■ 

X / G(t, X, m, u) \ 

r(t,R,m) :=l ' ’ Q ’ M, 

0(7?) = 0(X) = X, 0(7?) = 0(X) = px, (/)(R) = 0(X) = X, 0(^) = 0(A) = px. 
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With this notation, the system (18) can be rewritten in the following compact form 



dR{t) = 
R{0) = 



\f{t.R{t),EmR{t))lu{t))) 

Ro 



dt + r{t, R{t), E[<t>{R(t))])dB,, 



and the risk-sensitive cost functional (4) becomes 

J\u(:)) := £[<D (^R(T), £[0(7?(r»])], 



where, 

O (R(T), E[(l)(R(Tm := p(T) exp (Oh(x(T), E[p(T)x(T)]) + 0^(T )) . 

We define the Hamiltonian associated with random variables R such that 0(/?) e 
P) as follows. For (p, q) eWi? x 



+ tr(r*(r,/?,^[0(7?)])^), 

where, F* denotes the transpose of the matrix F. 

Setting 

/ Pi\ All ^12 \ 

P = [piF ^ I ^21 qi2 J , ( 20 ) 

\p2) \q3iq32/ 

the explicit form of the Hamiltonian (19) reads 



E[(l>(R(tMu(t))\ 

EmR(t))lu(t)))^^^ (19) 



R, p, q, u) \- 



'E(t, R(t), 
J(t,R(t), 



(t , p , X , ^ , p , q , u) := R, p, q, u) = c(t, x, E[x], u)p2 + fit, x, E[px}, m )/?3 

-\-oit, X, E[x])q22 + P^it, x)qn + a{t, x, E[x])q2\. 



In view of (12), we set 

' Hjit) \= Cjc(t)p 2 it) + fxit)P3it) + Cfxit)q22it) + pit)Pxit)qnit) + O^x(0^2l(0, 
H^it) := Cm(t)p2it) + fm(t)p3it) + CTmit)q 22 it) + Oim(t)q 2 lit), 

. H^piO = Pit,xit))qnit). 



We may apply the SMP for risk-neutral mean-field type control (cf. [1], Theorem 
3.1 or [5], Theorem 2.1) to the augmented state dynamics (p, x, §) to derive the first 
order adjoint equation: 
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dp{t) = - I + E[H^Jt)] + mE[fm{t)pi{t)] j dt + q{t)dB„ 

/(0p(r))-i\ (x{T)\ 

p(T) = I hAT) j - 0 I p(T) j 



This is a system of linear backward SDEs of mean-field type which, in view of 
([6], Theorem 3.1), under Assumption 1, admits a unique F-adapted solution (p, q) 
satisfying 



where, | • | denotes the usual Euclidean norm with appropriate dimension. 

We may apply the SMP for SDEs of mean-field type control from ([1], Theorem 
3.1 or [5], Theorem 2.1) together with the SMP for risk-neutral partially observable 
SDEs derived in ([23], Theorem 2.1) to obtain the following SMP. 

Proposition 1 Let Assumption 1 hold. If is an optimal solution of the 

risk-neutral control problem (17) subject to the dynamics (18), then there is a unique 
pair of V -adapted processes (p,q) which satisfies (21)-(22) such that 



E[W{t, R(t), pit), q(t), u) - HAt, R(t), pit), qit), M(r))|#-/] < 0, 



for all u e U, almost every t and V— almost surely. 



3.2 Transformation of the First Order Adjoint Process 

Although the result of Proposition 1 is a good SMP for the risk- sensitive mean-field 
type control with partial observations, augmenting the state process with the third 
component § yields a system of three adjoint equations that appears complicated to 
solve in concrete situations. In this section we apply the transformation of the adjoint 
processes {p,q) introduced in [11] in such a way to get rid of the third component 
(P3, ^31 , ^ 32 ) in (21) and express the SMP in terms of only two adjoint process that 
we denote {p,q), where 




( 22 ) 
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Indeed, noting that from (21), we have dp^it) = {q 3 (t),dBt) and P 3 (T) = —0x1/^, 
the explicit solution of this backward SDE is 

P3(l) = I #•,] = -6v\t), (24) 



where, 

v\t) := \ 0<t<T. 

In particular, we have (0) = E[xlf^]. Therefore, in view of (24), it would be natural 
to choose a transformation of (p, q) into an adjoint process (p, q) , where. 



such that 




/ ^11 4i2\ 

I ^21 ^22 I , 

\?31 ?32 / 



P3(0 = 



P3{t) 

9v^{t) 



= - 1 , 



0<t <T. 



This would imply that, for almost every 0 < t < T, 



hit) = (^3i(0, mit)) = 0, p - a.s.. 



(25) 



(26) 



which in turn would reduce the number of adjoint processes to those of the form 
given by (23). 

We consider the following transform: 



Ht) 



1 

9vHt) 



pit). 



0<t <T. 



In view of (21), we have 



piT) = - 



(iepiT))-^ 
( hAT) 



1 . 

p{T) ^E[f^jhmiT)]. 

\ 0 I n 



(27) 



(28) 



We should identify the processes a and q such that 



dp{t) — —a(t)dt + q(t)dBf, 



(29) 



for which (25) and (26) are satisfied. 
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In order to investigate the properties of these new processes (p, ^), the following 
properties of the generic martingale , used in [1 1] , are essential. We reproduce them 
here for the sake of completeness. Since, by Assumption 1, / and h are bounded by 
some constant C > 0, we have 

Therefore, is a uniformly integrable F-martingale satisfying 

0 < < v^r) < 0<t <T. 

Hence, in view of (2), we have 

E[ sup |v^(r)|2] < C. (30) 

o<t<r 



Furthermore, the martingale enjoys the following useful logarithmic transform 
established in ([12], Proposition 3.1) 



v^(r) = exp 0 J f{s, x(s), E[p(s)x(s)], u(s))ds^ , 0<t<T, (31) 

and 

v^(0) = E[\l/^] = exp(^Zo). 

Moreover, the process Z is the first component of the F-adapted pair of processes 
(Z, 1) which is the unique solution to the following quadratic BSDE: 



dZt = —{f{t, x{t), £'[p(5')x(5’)], u{t)) + ^\l{t)\^}dt + {i{t), dBt), 
Zr = pn p(T) +h(xT, E[p(T)^T)]). 
where, l(t) = (ii(t), 12(0) satisfies 

r pT 1 



\£{t)fdt 



< 00 . 



Lio 



(32) 



(33) 



In particular, v® solves the following linear backward SDE 



dv\t) = ev^{t){l(l), dB,), v\T) = 



(34) 
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Hence, 

Proof of Lemma 1. In view of (30), 

J‘\i(s)\^dsy.= Ll 0<t<T, (35) 

is a uniformly integrable F-martingale. □ 

To identify the processes a and q such that 

dp{t) = —a{t)dt -\-q(t)dBt, 

we may apply Ito’s formula to the process p(t) = Ov^ p(t), use (21) and (34) and 
identify the coefficients. We obtain 



a(t) = 



1 



/if;(0 + E[H^(t)]+x(t)E[U(t)p 3 (t)]\ 
( H^(t) + E[H^(t)] + mE[U(t)p3(t)] j 






+ 0q(t)i(t) 



(36) 



Therefore, 



+ E[W^{t)]^mEUm(t)p3{t)] 

= -eh) + E[H^Jt)] + mEUADPiit)] 

\ 0 

dvHt) = evHt){i(t),dBt), 

/(0p(r))->\ /x(T)\ 

p(T) = - hAT) - p(T) j^E[ir^jhAT)l 
vHT) 



dt + q{t)dBf , 



(37) 



where, Bf := Bt — 0 i(s)ds, 0 < t < T, which is, in view of (35) and Girsanov’s 
Theorem, a -Brownian motion, where 






In particular. 



<7^3(0 = (^3(0, -Ol(t)dt + dBt), phT) = -1. 
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Therefore, noting that p^it) \= [6v^ (t)] VsCO is square-integrable, we obtain 



Thus, its quadratic variation becomes \q 2 ,{t)\^dt = 0, — a.s. This implies 

that, for almost every 0 <t <T ,q 2 ,{t) = 0, and P — a.s. 

Hence, we can drop the last components from the adjoint processes {p,q) and only 
consider (keeping the same notation) 

\P2j \q 2 \q 22 ) 

for which (37) reduces to the risk-sensitive adjoint equation: 



dp{t) = - 



0v»(r) 



( 






H^it) + E[H^{t)] - p(f)£[/«(0] 

dvHt) = 



) 



dt + q{t)dBf 



(38) 



> \ hAT) 

vHt) = 



-1 



) \~piT))f\ 



-E[\l/ihr} 



(T)], 



In view of the uniqueness of F-adapted pairs (p, q), solution of (21), and the pair 
(v^, 1) obtained satisfying (33) and (34), the solution of the system of backward 
SDEs (38) is unique and satisfies (15). 



3.3 Risk-Sensitive Stochastic Maximum Principle 

We may use the transform (27) and (36) to obtain the explicit form (11) of the 
risk- sensitive Hamiltonian defined by 

Z(f), p{t), q{t). At), u) := m. Pit), qit), u), (39) 

where, W is defined by (19). 

Let 

6H\t) := R{t), p{t), q{t), u) - H^(t, R(t), p(t), q{t), u{t)) 
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and 

= H\t, X{t), pit), q(t), lit), u) - H\t, Z(f), pit), qit), lit), uit)). 

We have 

=0E[v^it)8H^it)\,^l] =0v^iO)E^[SH^it)\,^l], 
where, we recall that v^(f)/v^(0) = Lf = . 

Now, since 0 > Oand v^O) = E[if^] > 0, the variational inequality (1) translates 
into 

p(t), x{t), pit), qit), lit), u) - H\t, pit), xit), pit), qit), lit), uit))\^J] < 0. 

for all u e U, almost every t and P^— almost surely. This finishes the proof of 
Theorem 1. 



4 Illustrative Example: Linear-Quadratic Risk-Sensitive 
Model Under Partial Observation 



To illustrate our approach, we consider a one-dimensional linear diffusion with expo- 
nential quadratic cost functional. Perhaps, the easiest example of a linear-quadratic 
(LQ) risk- sensitive control problem with mean-field coupling is 

• f J7U 0\yj uHDdt+^x^m+fiE^lxiT)]] 

L2-«) J, 

subject to 

dxit) = iaxit) -h bu\t))dt -\- a dWt + adW^ , 
dYt = Pxit)dt +dW}^, 
v(0) = VO, To = 0, 

where, a, b, a, pi and cr are real constants. 

In this section we will illustrate our approach by only considering the LQ risk- 
sensitive control under partial observation without the mean-field coupling i.e. (/x = 
0) so that our result can be compared with [8] where a similar example (in many 
dimensions) is studied using the Dynamic Programming Principle. The case pi ^ 0 
can treated in a similar fashion (cf. [11]). 
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We consider the linear-quadratic risk- sensitive control problem: 



. ^ e\\ u^{t)dt+\x'^{T)\ 

inf„(.)g<2^ £“e L2 ■'0 2 J, 

subject to 

dx{t) = {ax{t) -h hu{t)) dt -\- a dWt + adW^ , 
dYt = Px{t)dt +dW}^, 
x(0 )=xo, Yo = 0 , 



(40) 



where, a, b, a, ^ and a are real constants. 

An admissible process (p(-), -^(O. ^(0) satisfying the necessary optimality condi- 
tions of Theorem 1 is obtained by solving the following system of forward-backward 
SDEs (cf. (5) and (14)) (see Remark 1, above). 



where, 



dp(t) = /3p(t)x(t)dYt, 

dx(t) = {cx(t) -h bu(t)} dt -h cfdWt + otdYt, 



dp(t) = - 






dt + q(t){—6l{t)dt + dBt), 



-1 



dvHt) = evHt){l{t),dB,), 
P(T) 

> \ x(D 

vHT) = 



(41) 



)■ 



P(0) 



1, x(0) = xo, 






and the associated risk-sensitive Hamiltonian is 



{t, p, V, u, p, q, i) := (cx + bu)p 2 — \u^ + p^x{q\\ + 0i\p\) 
+Q'(^21 + 0l2P\) + Cf{q22 + 0l2P2)- 



(42) 



In general the solution (v^ , i) primarily gives the correct form of the process i which 
may be a function of the optimal control u. Inserting i in the BSDE satisfied by 
(p, ^) in the system (41) and solving for (p, ^), we arrive at the characterization the 
optimal control of our problem. 
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For the LQ-control problem it turns out that by considering the BSDE satisfied by 
(v^, 1), we will find an explicit form of the optimal control u. Indeed, by (31), this 
is equivalent to consider the BSDE satisfied by (Z, f): 

' dZt = + I |£(r)PWr + dB,), 

Zt = I Inp(r) + \xj. 

Since u is , the form of Zj suggests that we characterize u and £ such that 

E^[Zt\^J] = 1 lnp(0 + 0 < r < r, 

z u 

where, y and rj are deterministic functions such that y(T) = 1 and r](T) = 0. In 
view of the SDEs satisfied by (p, x) in (41), applying Ito’s formula and identifying 
the coefficients, we get 

hit) = (ay(t) + P/0)x(t), hit) = ay(t)xit) (43) 



and 

E^[^ [y(t) + 2(c + + {6{a^ + a^) - b^)y^(t)) x^{t)\^l] 

+E^[fl(t) + i((r2 + a^)y{t) + (m(0 + by{t)x{t))^\^^] = 0. 



Hence, 

; y{t) + 2{c + aP)y(t) + + a^) - b^)y^{t) = 0, y(T) = 1, 

flit) + +oc^)y{t) =0, rj(T) = 0, 

where, the first equation is the risk- sensitive Riccati equation, and 

E'^mt) + by{t)x{t)f\^l] = 0. 



By the conditional Jensen’s inequality, we have 



E^[Uit)+byit)x(t)\.^l^] 



<E^mt) + by{t)x(t)f\^^]. 



Therefore, the optimal control is 

U(t) = -by(t)E^[x{t)\^ll (45) 

and the optimal dynamics solves the linear SDE 

dx(t) = (cx{t) — y (t) [x dt -h adWt + oidYt, x(0) = xq, (46) 
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where, by the filter equation of Theorem 8.1 in [22], 7tt{x) := is the 

solution of the SDE on (f2, F, P^): 

7Tt{x) =XQ + j {c - b^Y{s))7Ts{x)ds -\- J + (6ay(t) + y6) df^ , 

where, Tf = Yt~ jQ(0ay(s) + /3)7ts(x)ds is an (f2, F^, P^)-Brownian motion. 

Inserting the form (43) of I in the BSDE satisfied by (/?, ^) in the system (41) and 
solving for (/?, ^), we arrive at the same characterization the optimal control of our 
problem, obtained as a maximizer of the associated given by (42). We sketch the 
main steps and omit the details. 

We have 



Hu=bp2-u, Hp = ^x{qu +0lipi), = cp2 + ^p(qn + Olipi). 

The BSDE satisfied by (p, q) then reads 

dp\{t) = - {qn{t){fix{t) + 6i\{t)) + d{i\{t)p\{t)x{t) + qn{t)ti{t))] dt 
+ qn{t)dYt +qi 2 it)dWt, 

dp2(t) = - [cp 2 {t) + pp(t)(qn(t) + 0li{t)pi(t))} dt 

+ + q22^2{t))dt + q2\{t)dYt + q22(t)dWt, 

P2(T) = -x(T). 

(47) 



In view of Theorem 1, if w is an optimal control of the system (40), it is necessary 
that 

E^[bp2it)-d{t)\.^^] = 0. 

This yields 

m = bE^[p2it)\^l]. 

The associated state dynamics x solves then the SDE 

dx(t) = |cv(0 + dt + crdWt + adYt. 



It remains to compute E^ [p 2 (t)\^^ ]. Indeed, inserting the form (43) of I in the 
BSDE satisfied by (p,q) in the system (47), by Ito’s formula and identifying the 
coefficients, it is easy to check that (pi(t), quit), quit)) given by 



Pi(t) := 



1 

w’ 



^ii(0 



P x{t) 
0 



quit) \= 0 



solves the first adjoint equation in (47). Eurthermore, since P 2 (T) = —x(T), setting 
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where, X is a. deterministic function such that X(T) = 1, and identifying the coeffi- 
cients, we find that X satisfies the risk-sensitive Riccati equation in (44). Moreover, 

^21 (0 = -crHO, q 22 (t) = -aX(t). 

By uniqueness of the solution of the risk-sensitive Riccati equation in (44), it follows 
that X = y. Therefore, 

E^[p2{t)\^^] = -Y(t)E^[m\-^^l q 2 lit) = -crY{t). ^22(0 = -«K(0- 

Summing up: the optimal control of the LQ-problem (41) is 

M(r) = -bY(t)E^[x{t)\^^l (48) 

where, y solves the risk-sensitive Riccati equation 

y(t) -h 2(c -h a/3)y(t) -h (0(cr^ -h a^) - b^)y^(t) = 0, y(T) = 1. (49) 

The optimal dynamics solves the linear SDE 

dx(t) = {cx{t) — b^y(t)E^{x(t)\^J]^ dt -h adWt + otdYt, x(0) = vq, (50) 
and the filter (x) := E^\x{t)\^J ] is solution of the SDE on (f2, F, P^): 

TTtix) = xo-\- (c - b^y{s))7Ts(x)ds (a (Oay{t) -h b) - 7T^^(x)j^ dY^ , 

where, Y^ = Yf — f^ibayis) -h jS)7ts(x)ds is an (f2, F^, P^)-Brownian motion. 

Open Access This chapter is distributed under the terms of the Creative Commons Attribution 
Noncommercial License, which permits any noncommercial use, distribution, and reproduction in 
any medium, provided the original author(s) and source are credited. 
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Abstract According to theoretical arguments, a properly designed emission trad- 
ing system should help reaching pollution reduction at low social burden based on 
the theoretical work of environmental economists, cap-and- trade systems are put 
into operations all over the world. However, the practice from emissions trading 
yields a real stress test for the underlying theory and reveals a number of its weak 
points. This paper aims to fill the gap between general welfare concepts underly- 
ing understanding of liberalized market and specific issues of real-world emission 
market operation. In our work, we present a novel technique to analyze emission 
market equilibrium in order to address diverse questions in the setting of risk-averse 
market players. Our contribution significantly upgrades all existing models in this 
field, which neglect risk-aversion aspects at the cost of having a wide range of sin- 
gularities in their conclusions, now resolved in our approach. Furthermore, we show 
both how the architecture of an environmental market can be optimized under the 
realistic assumption of risk-aversion. 
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1 Practice of the EU ETS 

A properly designed emission trading system should help reducing pollution reduc- 
tion with low social burden. 

In this paper we understand it as a burden to the society, caused by energy pro- 
duction. We assume that it can be measured in monetary units including both, the 
overall production costs and an appropriately quantified environmental impact of 
energy production. 

Originated from this idea, and based on the theoretical work of environmental 
economists, cap-and-trade systems have been put into operations all over the world. 

The problem of design optimization for emission trading schemes has been 
addressed in [4]. This work shows that, in general, a traditional architecture of 
environmental markets is far from being optimal, meaning that appropriate alter- 
ations may provide significant improvements in emission reduction performances at 
lower social burden. Such improvements can be achieved by extending a regulatory 
framework, which we address below as extended scheme. 

Let us explain this. 

In the traditional scheme, it is assumed that the administrator allocates a pre- 
determined allowance number to the market and sets a compliance date at which a 
penalty must be paid for each unit of pollutant not covered by allowances. Hence, 
the policy maker can exercise merely two controls, the so-called cap (total amount of 
allowances allocated to the market) and the penalty size. In theory, a desired pollution 
reduction can be reached at some costs for the society by an appropriate choice of 
these parameters. However, in practice, there is not much flexibility, since the cap 
is motivated politically and the penalty is determined to provide enough incentives 
for the required pollution reduction. As a result, the performance of the traditional 
scheme could be very poor in terms of social burden for the achieved reduction. 

In an extended scheme, the policy maker has much more influence. The regulator 
can tax or subsidize the production in terms of monetary units or in terms of emission 
certificates. These additional controls can be implemented in a technology-sensitive 
way. Doing so, the merit order of technologies can be changed significantly. On this 
account, emission savings, triggered by certificate prices, also become controllable. 
The work [4] illustrates that, by an appropriate choice of additional controls, the 
market can reach a targeted pollution reduction at much lower social burden. 

Although these theoretical findings are sound, intuitive, and practically important, 
the optimization of environmental market architectures could not be brought to a level 
suitable for practical implementation. There are two reasons for this. 

(1) The existing approach [4] is based on the unrealistic assumption that each of 
the market players is non-risk-averse in the sense that it realizes a linear utility 
function. This assumption is not conform with the modem view and creates 
a number of singularities in the model. A priori, it is not even clear which 
conclusions of this work do hold under risk- aversion. 
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(2) Although the practical advantage of such market design optimization is obvi- 
ous, policy makers hardly can use the theoretical findings of 14], because their 
quantitative assessment requires optimal control techniques whose numerics is 
difficult. 

In this work, we address both issues, namely: 

(h) We assume a non-linear utility function for market agents and show several 
properties of the market equilibrium which make market design optimization 
possible. With this, our model is brought in line with standard economic theory 
and is appropriate for further developments. We also emphasize that, to capture 
risk-aversion, a completely new argumentation has been developed. 

(20 We provide our study in a one-period setting. Being accessible without optimal 
control techniques, the results become evident and potentially usable for a broad 
audience, including practitioners and decision makers. 

The paper is organized as follows. Section 2 discusses the literature developed 
concerning markets of emission certificates. In Sect. 3 we introduce our equilibrium 
model. Section 4 deepens the analysis of the equilibrium. Sections studies the social 
optimality of the equilibrium and proves that it corresponds to the overall minimum- 
cost policy under a risk-neutral probability distribution. Section 6 discusses some 
perspectives of optimal market design. The final Sect. 7 provides conclusions. 



2 Theory of Marketable Pollution Rights 

The efficiency properties of environmental markets have been first addressed in 
16, 10], who first advocated the principle that the “environment” is a good that 
can not be “consumed” for free. In particular, Montgomery describes a system of 
tradable certificates issued by a public authority coupled with fixing a cap to the total 
emissions, and, doing so, to force polluting companies paying proportionally to the 
environmental damage generated by their production activity. An emission certificate 
is representative of the permission to emit a given quantity of pollutant without being 
penalized. Companies with low environmental impact can sell excess certificates and 
the resulting revenue represents a general incentive to reduce pollution. Montgomery 
shows that the equilibrium price for a certificate must be driven by the cost of the 
most virtuous company to abate its marginal unit of pollutant. The key result of his 
analysis is that such a system guarantees that the reduction of pollution is distributed 
among the companies efficiently, that is minimizing their total costs. 

After the seminal analysis of Montgomery, which is based on a deterministic and 
static model, the following research has taken the direction to the stochastic and 
multi-period settings. A literature review on the research which has developed after 
Montgomery’s work can be found in 1 14] . A common result shared by all the analyses 
developed so far is that cap-and-trade systems indeed represent the most efficient way 
to reduce and control the environmental damage generated by the industrial activity. 
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Let us mention the contributions which are directly related to our analysis. A 
majority of relatively recent papers [1-5, 11, 13] are related to equilibrium models, 
where risk-neutral individuals optimize the expected value of their profit or cost 
function. The hypothesis of risk-neutrality of the agents is gracefully assumed in 
those contributions, since it significantly simplifies the proof that environmental 
markets are efficient. Some papers have considered explicitly risk-averse decision 
makers. One of them, [9], develops a pricing model for the spot and derivative pricing 
of environmental certificates in a single-period economy. In [7], the authors also 
develop a (multi-period) equilibrium pricing model for contingent claims depending 
on environmental certificates, where risk-averse agents maximize the expected utility 
of their profit function. 



3 One-Period Equilibrium of Emission Market 

To explain the emission price mechanism, we present a market model where a finite 
number of agents, indexed by the set /, is confronted with abatement of pollution. 
The key assumptions are: 

• We consider a trading scheme in isolation, within a time horizon [0, T], without 
credit transfer from and to other markets. That is, unused emission allowances 
expire worthless. 

• There is no production strategy adjustment within the compliance period [0, T]. 
This means that the agents schedule their production plans for the entire period 
[0, T] at the beginning. Allowances can be traded twice: at time r = 0 at the 
beginning and at time t = T immediately before emission reports are surrendered 
to the regulator. 

• For the sake of simplicity, we set the interest rate to zero. 

• Each agent decides how much energy to produce and how many allowances to 
trade. 

Note that this one-period model is best suited for our needs to explain the core 
mechanism of market operation and to discuss its properties. A generalization to a 
multi-period framework is possible, but it gives no additional insights related to the 
goal of this work. 

The ith agent is specified by the set of feasible production plans for the 
generation of energy (electricity) within one time period from ^ = 0 to ^ = T. 
Further, we consider the following mappings, defined on for each agent i e I: 

with the interpretation that for production plan g S' ^ the values Vq (§q), Cq(§q), 
and £” 0 (^ 0 ) stand for the total production volume, the total production costs, and the 
total carbon dioxide emission, respectively. 
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Production: At time ^ = 0, each agent i G I faces the energy demand Do g M+ of 
the entire market, the realized electricity price Po G M+, and the emission allowance 
price Aq g Based on this information, each agent decides on its production plan 
G S\ where is the set of feasible production plans. Given g time T, 

agent realizes the total production costs, 

e M, (1) 

the production volume 

( 2 ) 

and the total revenue, V'o (§o)’ electricity sold. 

Allowance allocation: We assume that the administrator allocates a pre-determined 
number g [0, oo[ of allowances to each agent i. 

So far, we have introduced deterministic quantities. Let us now turn to uncertainties 
modeled by random variables on the probability space (^2, P). 

Emission from production: Following the production plan §q, the total pollution of 
agent i is expressed as £’^(§q). 

Remark (Randomness in demand and production) The question of randomness in 
energy demand and production deserves a careful argumentation. The reader may 
be confused by the assumption that in our one-period modeling, the time unit may 
correspond to the entire compliance period (which suggests a rather long time), such 
that our assumption on deterministic demand and unflexible production schedule 
appears unrealistic. To ease understanding, one shall imagine an artificial emission 
market model for short time period, say one day until compliance. The point of our 
proposal is that the elements, the arguments and the techniques required to define the 
optimal production plan on a daily basis are the same of those required to identify 
the plan over a generalized period [0, T]. The value of this toy model is that it 
allows a straight-forward generalization to the multi-period situation. In our one- 
period modeling, we assume that the nominal energy demand Do is non-random 
and is observed at the time r = 0 when production decisions are made. We also 
suppose that the production plan of each agent is deterministically scheduled at 
time t = 0. This view is in line with the current practice in energy business, where 
a nominal energy production volume along with a detailed schedule of production 
units is planed non-randomly in advance. Of course, the realized energy consump- 
tion deviates from what has been predicted. However, based on our experience in 
energy markets, it does not make sense to include this random factor into equilibrium 
modeling, since all decisions are made on the basis of a non-random demand antici- 
pation and non-random customer’s requests for energy delivery. To maintain energy 
consumption fluctuations in real-time, diverse auxiliary mechanisms are used. They 
can be considered as purely technical measures (security of supply by reserve mar- 
gins). For this reason, we believe that it is natural to assume that, although the energy 
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demand Dq is known and production plan is deterministically scheduled at time 
r = 0, the total emission, associated with this production can not be predicted with 
certainty at time r = 0 when the production and trading decisions are made. In fact, in 
practice the producers have to manage diverse source of randomness while following 
production, (demand fluctuation, outages of generators) which yields usually small 
but unpredictable deviations from the nominal emission associated with 

production plan §q. Thus, let us agree that £’j(§q) is modeled as a random variable 
given as a sum 

4(§^) = + N\ iel (3) 

with deterministic function 

describing the dependence of the nominal emission on the production plan and a 
random variable standing for the deviation from the nominal emission. Note that 
the random emission £’j(§q) will be the only source of uncertainty in our model. 

To ease our analysis, let us agree on the natural assumption that for production 
schedules g St i g I the total market emission X/e/ possesses no point 

masses: 



P(^ =z)=0 for all z € M. (4) 

iel 

Allowance trading: At times t = the allowance permits can be exchanged 
between agents by trading at the prices Aq and At, respectively. Denote by ?7 q, 
the change at times ^ = 0, T of the allowance number held by agent iel. Such 
trading yields a revenue, which is 



-z^^Ao-4Ar. (5) 

Note that and Aq are deterministic, whereas and At are modeled as random 
variables. Observe that sales are described by negative values of z^q, therefore 
(5) is non-negative random variable, if permits are sold. 

Penalty payment: As mentioned above, the penalty tt g [0, -\-oc[ must be paid at 
maturity T for each unit of pollutant not covered by allowances. Given the changes 
at times r = 0, T due to allowance trading, i.e. z^q and z7^, the production §q, and 
the total number of allowances allocated to agent iel, the loss of agent i due to 
potential penalty payment is given by 

- 4 - 4 - y^)+. (6) 

Individual profit: In view of (l)-(6), the profit of agent i e I following trading and 
production strategy (z7^ , depends on the market prices {A, P) = 
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(Aq, At, Pq) for allowances and energy and is given by 

Ao - 4 At - 4(4) + PoV44) 

-n(E44)-4-4-Y‘)+. 



Note that the individual profit could be negative. 

Risk-aversion and rational behavior. Suppose that the risk attitudes of each agent 
i G / are described by a pre-specified strictly increasing utility function : M ^ M. 

With this, the rational behavior of the agent i is targeted on the maximization of the 
functional 



over all the possible trading and production strategies = (z7q, §q). 

Energy demand: Suppose that at time ^ = 0 all agents observe the total energy 
demand, which is described by Dq g Let us agree that the demand must be 
covered. 

Market equilibrium: Following standard apprehension, a realistic market state is 
described by the so-called equilibrium — a situation where all allowance prices, all 
allowance positions, and all production decisions are such that each agent is satisfied 
by its own policy and, at the same time, natural restrictions are fulfilled. 

Definition 1 Given energy demand Dq g M+, the prices (A*, F*) = (Aq, A^, 
Pq) are called equilibrium prices, if, for each agent i e I, there exists a strategy 
^ 

(i) the energy demand is covered 

Y,vl(4*) = Do, 

iel 

(ii) the emission certificates are in zero net supply 

^ df* = 0 almost surely fovt = Omdt = T, (7) 

iel 

(iii) each agent / G / is satisfied by its own policy in the sense that 

E(u‘(r*’^*'‘(&‘*,4))) >E(u‘(r*'^*’‘(&‘,^‘))) ( 8 ) 

holds for any alternative strategy 

The main objective of this section is to prove that in the present model the electricity 
price formation is determined by the usual merit order arguments, where the effect of 
emission regulation causes emission allowance prices to enter the costs of production 
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at the specific emission rate. This issue can be considered as the core mechanism of 
any cap-and-trade system, since including pollution costs into final product prices 
causes a change of the merit order of production technologies towards a cleaner 
production. To formulate this result, let us elaborate on the opportunity costs and 
introduce additional definitions. 

Opportunity costs: In the economic literature, they stand for the forgone benefit 
from using a certain strategy compared to the next best alternative. For example, the 
opportunity costs of farming own land is the amount which could be obtained by 
renting the land to someone else. Let us explain how the opportunity costs necessarily 
lead to windfall profits. 

When facing energy (electricity) generation, producers consider a profit, which 
could be potentially realized when, instead of production, unused emission allowances 
were sold to the market. For instance, if the price of the emission certificates is 12 
€ per tonne of C02 and the production of one Megawatt-hour (MWh) emits two 
tonnes of C02 (say, using a coal-fired steam turbine), then the producer must decide 
between two strategies which are equivalent in terms of their emission certificate 
balance: 

• produce and sell one MWh to the market, 

• do not produce this MWh and sell allowances covering two tonnes of C02. 

In this situation, the opportunity costs of producing one MWh are 2 x 12 = 24 €. 
Obviously, the agent produces energy only if the first strategy is at least as profitable 
as the second one. Thereby, both the production and the opportunity costs must be 
considered in the formation of the electricity market price. Clearly, if the production 
costs of electricity are 30 € per MWh, then the energy will be produced only if its 
price covers both the production and the opportunity costs. Thus electricity can only 
be delivered at a price exceeding 30-h2xl2 = 54€. That is, in order to trigger the 
electricity production, the opportunity costs must be added to the production costs. 

In the scientific community, this phenomenon is well-known under the name of 
cost-pass-through. An empirical analysis, see [12] confirms that the strategy of cost- 
pass -through is currently followed by the European energy producers. Furthermore, 
the detailed investigation of mathematical market models shows that the cost-pass- 
through is the only possible strategy in the so-called equilibrium state of the market. 
This can be interpreted as follows: when behaving optimally, the energy producers 
must pass the allowance price on to the consumers. Note that the producer obtains a 
windfall profit of 24 € in anycase: if electricity price is higher than 54, by passing- 
through the price of the certificates (that he has received for free); if the price is less 
than 54, by selling 2 certificates at 12 € on the emission market. 

More importantly, it turns out that the cost-pass-through is nothing but the core 
mechanism responsible for the emission savings. Namely, due to the opportunity 
costs, clean technologies appear cheaper than emission-intense production strategies. 
For instance, the alternative generation technology represented by a gas turbine, 
which yields energy at the price of 40 € and emits only one tonne of C02, hardly 
competes with a coal-fired steam turbine under generic regime (without emissions 
regulation). Namely, if there is no regulatory framework, then the coal-fired steam 
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turbine is scheduled first and the gas turbine has to wait until the energy demand 
can not be covered by coal-fired steam technologies. However, given an emission 
regulation, the opposite is true: say, if the allowance price is equal to 12 € per tonne 
of C02 as above, then the gas technology appears cheaper, operating at full costs 
of 40 + 1 X 12 = 52 €. Thus, the gas turbine is scheduled first, followed by the 
coal-fired steam turbine, which runs only if the installed gas turbine capacity does 
not cover the energy demand. 

In the next section, we will show that the only rational behavior in equilibrium 
is to pass the opportunity costs on to the consumers. For this, we require additional 
notions. 

Definition 2 Consider a given energy amount d g M+ and a given allowance price 

a e M+. 

(i) Introduce the individual opportunity merit order costs of agent / G / as 

a) = inf{C^(§^) + G > d]. 

An individual production plan g is called conform with opportunity costs at 
emission price a G M+ if 

that is to say is confirm if it minimizes the production and emission costs among 
all the alternative plans offering the same generation and given emission price a. 

(ii) Introduce the cumulative opportunity merit order costs as 

a) = inf{^(4(t^) + e E\ i € I, > d}. 

iel iel 



The production plans g S\ i g /, are called conform with opportunity costs at 
emission price a G M+ if 

«) = Z (^o(^^) + «4(?(^)) • 

iel iel 

(iii) Any price p G M+ with the property that 

— ^(d, a) -\- pd < —^(d, a) -h pd for all J G M+ (9) 

is referred to as an opportunity merit order electricity price at (d, a) and it can be 
understand as the marginal cost for the entire generation sector given the level of 
demand d and emission price a. 
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4 Properties of Equilibrium 

With these definitions, we now show that, within any equilibrium, the production 
plans are always conform with opportunity costs. Furthermore, the equilibrium elec- 
tricity price is always an opportunity merit order price. 

Proposition 1 Given energy demand Dq, let (A*, P*) = (Aq, A^, Pq ) he the equi- 
librium prices with the corresponding strategies i e I , then the following 

points hold: 

(i) For each agent i G I, the individual production plan §q* is conform with oppor- 
tunity costs at emission price Aq.* 



(ii) The market production schedule §q*, i G I, is conform with opportunity costs at 
emission price Aq.* 



(Hi) Pq is an opportunity merit order price at (X/e/ ^0^* 

The direct economic consequence of this mathematical result is that each individual 
will organize its own production strategy by scheduling power production units in 
an increasing price order. Thereby their variable costs (which include the opportu- 
nity costs of using the emission certificates) are considered. Within such a schedule, 
a demand d is satisfied by gradually turning on the most economic plants, until a 
generation level matching d is reached. Furthermore, the above proposition states 
that such schedule is reached not only on the individual level, but also for the entire 
market. Namely, an overall demand d is satisfied by gradually turning on the most eco- 
nomic plants until reaching a production which covers the demand d. Such aggregate 
ordering is usually called merit order, for this reason we call (J, a) and ^(d, a) 
(agent’s i) opportunity merit order costs and cumulative opportunity merit order 
costs, respectively. 

It is worth noticing that the opportunity merit order electricity price as defined in 
(9) is equal to the marginal cost of generating electricity when the level of demand is 
d given certificate price a. Coupling this property with the merit order production in 
the electricity sector, implies that the most expensive production in the plan §q will 
determine the marginal cost at demand level d and emission price a. The opportunity 
merit order electricity price at (d, a) is defined as the lowest price, which is able to 
trigger the required production level. 

Proof (i) Consider the equilibrium strategy (?7 q* , §q*) of agent i G I. Assume that the 
agent deviates from this strategy following an alternative production plan §q g 
However, to keep the same emission credit balance, the difference £’q(§q) — £’q(§q*) 




( 10 ) 




( 11 ) 
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is traded at the market in addition to z7q*. That is, we change the equilibrium trading 
strategy (z^q*, to an alternative trading strategy (z^q, z7^) given by 



Note that we have changed only the initial position, from z^q* to z^q, whereas the 
final position is the same z^^ = z^^* . A direct calculation shows that the profit of this 
alternative strategy (z7^ = (z7q, z7^, §q) can be written as 






(^^r) 









i.e. 



it differs form the original profit by 



the amount 



Note that R(^q, §q*) ^^1 be positive, since otherwise 






would yield 



thus contradicting the optimality of the equilibrium strategy (z7^* , (see (8)). Now, 

from R(^q, §q*) <0 we conclude that 



- + ^0 ^(5(^^*) > -Chi^o) - + Po*^(5(^^) (12) 



for each g S\ With this, we conclude the desired assertion (10) as follows: 
any alternative production plan which produces an energy amount (§q) at least 
equal to Vq (§q*) must satisfy 



cWo) + < cWo) + 



Thus, 



C' = inf{C' (^') + AlE^ip^) : e S', 



(ii) Summing up (12) over / g /, yields, for arbitrary choices of g S\i g /, 
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- z + K z ^0(^0*) 

iel iel 

> - Z (<^o(^^) + + Po z ^0 (^^)- ( 13 ) 

iel iel 



From this, we conclude that, for any choice g S\ i e /, of production plans 
satisfying 



Zl^o(^o)>Zl"o(^^’ 

iel iel 



it holds 



Z + AoEk^k)) < Z ^ 

iel iel 

implying the desired assertion (11). 

(iii) We need to prove that, for any d e M+, 

-nd, A*o) + P^d < A*o) + Po* Z l"o(^o*)- 

iel iel 

For each choice of production strategies g S\ i g /, estimate (13), combined 
with (11), gives 



-<T(Z yi> K) + Po* Z ^0 

iel iel 

> - Z (<^o(^o) + ^S4(?o)) + ^0 Z 1^0 (^o)- 

iel iel 

In particular, if the strategies are chosen from 

e E‘, iel, ^ > d}, 

iel 



then it holds that 

-^(Z 1^0 (^0*). A*o) + Po* Z 1^0 (^0*) > - z (<^o(^^) + A*oEk^k) + Pod- 

iel iel iel 

Passing on the right-hand side of this inequality to 

nd, A*q) := inf{^(C' (!') + A*oEk^k) ■ ?0 ^ S', i ^ I, Z > d}, 

iel iel 
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yields the desired assertion 

AS) + Po* Z AS) + P^d. 

iel iel 

Remark The statement (ii) of the above proposition characterizes equilibrium in 
terms of aggregated quantities. Once the equilibrium is reached, the production 
schedule represents the cheapest way to satisfy the demand. From this perspective, 
the reader may conclude that the equilibrium production schedule can be obtained 
as a production plan which minimizes the overall costs among those which cover 
a given demand, indicating that only aggregated quantities do influence the equi- 
librium. However, we shall emphasize that the equilibrium still heavily depends on 
individual ingredients (such as initial endowments and risk aversion), which enter 
through the initial allowance price. 

Now, we show another natural property of the equilibrium allowance prices. It 
turns out that there is no arbitrage allowance trading and that the terminal allowance 
price is digital. 

Proposition 2 Given energy demand Dq, let (A*, P*) = (Aq, A^, Pq ) he the equi- 
librium prices with the corresponding strategies (?^^*, §^*), i G It holds: 

(i) There exists a risk-neutral measure Q* P such that A* = (Aq, A^) follows a 
martingale with respect to Q*. 

(ii) The terminal allowance price in equilibrium is digital 

A*T = 4«o*)-Ko>0)- (14) 

Proof (i) According to the first fundamental theorem of asset pricing, see [8] in 
discrete-time setting, the existence of the so-called equivalent martingale measure 
satisfying Aq = E^*(Ar) is ensured by the absence of arbitrage. Fortunately, in 
our framework, the absence of arbitrage follows from the equilibrium notion, as we 
show next. We thus conclude (i) of the above theorem and it remains to verify that the 
equilibrium rules out all arbitrage opportunities for allowance trading. Let us follow 
an indirect proof, assuming that vq is an arbitrage allowance trading, meaning that 

P(vo(A* - AS) > 0) = 1, P(vo(A^ - AS) > 0) > 0. (15) 

Based on this we obtain a contradiction by showing that each agent i can change its 
own policy (z^^*, §^*) to an improved strategy (^S §^*) satisfying 

e(u‘ <E(u‘{L^*’\¥,^‘*))j . (16) 

The improvement is achieved by incorporating arbitrage vq into the allowance trading 
of each agent i as follows: 

^S := dl; + VO, 4 •= 4* - ^0. 
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Indeed, the revenue improvement from allowance trading is 

- ¥jA*j = -¥qAI - ¥jA*j + vo(A*t - AS), 
which we combine with (15) to see that 

p ^‘*) < L '^’‘ = 1, p f‘) < > 0, 

which implies (16), therefore contradicting the optimality of §^*). 

(ii) From equilibrium property (8), it follows that for almost each a; G ^2 the terminal 
allowance position adjustment §t (<^) is a maximizer on M to 

z ^ -zA*j{co) - ;r(£^(tS*)(a>) - - yI ~ zt ■ (H) 

Note that a maximizer of this mapping exists only if 0 < Aj{co) < tt. That is, the 
terminal allowance price in equilibrium must be within the interval Aj g [0, tt] 
almost surely. Let us show now that the price actually attains only boundary values 
almost surely, i.e. 



A \ G {0, tt} almost surely. (18) 

Suppose that an intermediate value Aj { a )) g]0, 7r[ is taken, then the unique 
maximizer of function (17) is attained on Ej{^l^){co) — z^q* — This implies that 
= £'^(§q*)((z>) — z^Q* — ]/q holds for each i g /, and a summation over i 

yields 

^ ¥*{w) = ^ (4(|^*)(o>) - 4 - K(S) = Z 4(4)(«) - Ko- 

iel iel iel 

Note that equilibrium property (7) ensures that the random variable on the left-hand 
side of the above equality is zero almost surely. Thus, the inclusion 

{CO :A* e]0, :r[} c {w : ^ 4(4) - yo = 0} (19) 

iel 

holds almost surely. Because of (4), the probability of the event on the right-hand 
side of the above inclusion is zero, which shows (18). 

If Aj{(jo) = 0, then a maximizer z7^*(az) to the function (17) is attained on 
[£^(§^*)(o;) - z^^* - 7(5, -hoo[. Hence 

{a; :A* = 0} C {a; :4(§^*) - z^^* - < z^[*} 



holds almost surely for each / g /, which implies that 
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[a> = 0} c ^ 4(|'*) - yo < X 

iel iel 

holds almost surely. Now, because of the equilibrium property (7), we obtain the 
almost sure inclusion 

{ft) = 0} c {ft, : ^ £^(4) - yo < 0}. 

iel 

Since the probability of g]0, tt 1 is zero (19), we conclude for the complementary 

event that 

{ft, ;A* = ;r) 3 {ft, : - yo > 0) (20) 

iel 

holds almost surely. Let us show the opposite inclusion. If Aj{cd) = tt, then a 
maximizer to function (17) is attained on ] — oo, £’^(§q*)(o;) — z7q* — /q]. 

Hence, 



{u; :A* = TT} c (4(§^*) - ^ > 4*} 

holds almost surely for each / g /, which implies that 

{ft, : A* 7^ = :r} c {ft, ; ^ 4(4) - yo > ^ 4*1 

holds almost surely. Now, because of the equilibrium property (7), we obtain 

{ft,:A* =:r}c{ft,:;^4(4)-yo>0}. (21) 

iel 

Finally, combine inclusions (20) and (21) to obtain assertion (14). 



5 Social Optimality 

To formulate social optimality, we require additional notations. Given production 
strategies g E\ i g /, we denote the overall market production schedule by 
= (^q)/^/ and introduce the total production costs Cq, the total production volume 
Vo, and the total carbon dioxide emission Ej and the total nominal carbon dioxide 
emission, defined by 



Co(?o) = Xc'o(?o)- = £r(§o) = X4(?o)^ £o(?o) = X4(?o)- 

iel iel iel iel 
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Having in mind that Co(§o) stands for the overall costs of the production and inter- 
preting 7r(£’r(§o) — Ko)^ as a proxy of the environmental impact of the production 
schedule let us agree that 

B(?o) = Co(lo) + 7r(£r(lo) - Ko)+ 

expresses the social burden caused by the overall production plan ^ 

It turns out that the equilibrium strategy minimizes the social burden among all 
production strategies which cover a given demand. 

Proposition 3 Given energy demand Dq, let (A*, P*) = (Aq, A^, Pq) he the equi- 
librium prices with the corresponding strategies i G I. Let Q* be a 

risk-neutral measure whose existence is shown in Proposition 2. Then 

(5(^0)) (22) 

holds for each production schedule ^ which yields at least the 

same production volume, Vb(§o) > = ^o- 

Proof For each convex function / : M ^ M, v /(v), it holds f{x) -\-V f{x)h < 
f{x-\-h),h G M, where V/(v) stands for one of the sub-gradients of / at the 
point X. In particular, for convex function / : M ^ M+, x x+, we obtain 
x+ -h l{;c>0}^ < (x -h for all X, /z g M. With the equilibrium production strategy 
= (§Q*)/g/, we conclude that 

(£r(?o) - Ko)+ + l(£H?o*)-K) 50 )(£r(^o) - £r(^o» < (^r(?o) - Ko)+ 

holds almost surely for any production strategy ^ Using our model 

assumption (3) we deduce £’r(§o) — = £’o(§o) — ^o(§o) which gives 

(£r(^o) - Ko)+ + l{£H?o*)-yo>o)(-Eo(^o) - £ 0 (^ 0 )) < (^r(^o) - Ko)+- 

Calculating the expectations with respect to Q* on both sides and multiplying both 
sides by 7T , we obtain 

ttE^* ((£r(^0*) - Ko)+) (l{£r(?o*)-Ko>0)) (-^ 0 (^ 0 ) - £o(^o» ^ 

< {(Et(^o) - YoV) . 

Using the martingale property and the digital terminal value of the equilibrium 
allowance prices shown in Proposition 2, we finally obtain 

:tEQ* ((Er(?o) “ Ko)+) + A^CEoC^o) - £o(^o» < (C^rC^o) - Ko)+) • 

(23) 
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For the case that the strategy yields at least the total production volume of the 
equilibrium strategy, Vb(§o) > Vo(§q), assertion (11) in Proposition 1 yields the 
estimate 



Co(§o*) + ^S^o(§o) < ^o(?o) + AS^o(§o), 

which is equivalent to 

Co(§o) - ^o(?o) < A^(Eom - ^o(?o*))- 

Now, combining the last inequality with (23), we obtain 

Co(^o*) + ((-Er(^o) - K 0 )+) < Co(^o) + {(Et(Io) ~ Ko)+) , 

which proves our claim (22). 

In Proposition 3, the equilibrium production schedule was characterized as a 
solution to the minimization problem 

min{E«2*(B(|o)) : to e Vo(lo) > -Do}- (24) 

Although this fact is about minimization of social burden, it should not be interpreted 
as one of the classical welfare results, which typically follow from equilibrium con- 
siderations. 

An interesting point here is that this type of cost-optimality needs to be taken with 
great care: due to the opportunity cost-pass-through, the consumers can not expect 
that an (inappropriately designed) cap-and-trade mechanism indeed implements the 
cheapest way of emission reduction, from their perspective. 

To see this point, remember that the price per unit of electricity under the merit 
order system includes the opportunity costs of consuming the emission certificates. 
Therefore, given emission price Aq and an overall production schedule the con- 
sumers pay the costs X/e/ AoE^(^q) to switch in the merit order and to reduce 
emissions. From a global perspective, this costs stands for a wealth re-distribution. 
From the consumer’s perspective, it is associated with a burden. 



6 Equilibrium-Like Risk-Neutral Modeling 



Another interesting observation from Proposition 3 is that the expectation 
of the social burden B(^o) is minimized with respect to a risk-neutral 
measure Q* which differs from the objective measure P. The measure Q* is an 
outcome of the equilibrium, and, as such, it heavily depends on the many model 
components, for instance on the risk- aversions, on the certificate endowments, and 
on the production technologies of the agents. However, it is surprising that, once the 
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measure Q* is known, other important equilibrium outcomes can be deduced from 
aggregated quantities only. 

In particular, given Q*, the equilibrium production schedule can be obtained as 
the solution of optimization problem (24). Such solution is determined by aggregated 
quantities, since the social burden is by definition 5(§o) = + ^(^r(§o) “ 

yo)^ and, apart the quantities yo and tt decided by the authority, it depends only 
on technologies present in the market. Having obtained the equilibrium production 
schedule as the solution of optimization problem (24), the equilibrium allowance 
price Aq is calculated applying martingale pricing: 



Aq = :tE'® (l{£j-(fo)-Ko>0))- 

Finally, given the production schedule and the allowance price Aq, also the elec- 
tricity price Pq is determined as the marginal price of the most expensive technology, 
which is active in the schedule . Note that the opportunity costs must be included 
when identifying the most expensive active technology. 

Summarizing, we conclude that given Q*, merely aggregated market parameters 
are needed to obtain , Aq, and Fq*. This observation can be used to establish and to 
analyze realistic equilibrium-like emission market models. Such models are needed, 
since in real emission trading it is nearly impossible to estimate the equilibrium from 
a market model, because the individual parameters are highly undetermined. For 
instance, within the EU ETS, there are more than 25, 000 agents, each with a specific 
production, its own certificate endowment, and a completely unknown risk-aversion. 
On the contrary, the aggregated quantities are well-known, since high-quality market 
data on total allowance allocation and electricity production, including capacities, 
costs, and emission rates, are available. 

In view of this, we suggest an alternative way to estimate the market equilib- 
rium based on aggregated quantities and using an exogenously specified proxy for 
risk-neutral measure Q* . This general approach follows the standard methodology 
of financial mathematics, which successfully describes the stochastic evolution of 
equilibrium prices on financial markets under an appropriately chosen risk-neutral 
measure. 



6.1 Market Equilibrium Under a Risk-Neutral Measure 



We sketch the following program for equilibrium-type modeling of emission markets: 

( 1 ) Determine a risk-neutral measure Q*, which corresponds to an equilibrium situ- 
ation of the emission market in the sense of (i) of Proposition 2. 

(2) Observe that, because of Proposition 3, the equilibrium production schedule 
must be a solution to the deterministic optimization problem 



Risk Aversion in Modeling of Cap-and-Trade Mechanism . . . 



283 



minimize Co (^o) + :tE'®*((£: 7 -(^o) - Ko)"';) . 25 ) 

subject to Vo (^ 0 ) > Dq, overdo e 

To address the problem further, a specification of the space of market pro- 

duction strategies along with the functions Cq, Vq and Et is required. 

(3) Given the equilibrium production schedule, calculate the total production costs 
C’o(§ 0 the total carbon dioxide emission and the energy price P* to assess 

the performance in emission reduction of the current market architecture. 

Remark Note that a risk neutral measure is not unique. Clearly, finding a realistic 
candidate for the risk-neutral measure Q* can be difficult. However, notice that one 
merely needs to specify the fluctuations of the non-predictable emissions under a 
risk-neutral measure. This distribution can be described in a parameter-dependent 
way, which adds flexibility to the model. For instance, having assumed a Gaussian 
framework under objective measure and modeling the density of the risk-neutral mea- 
sure in terms of a Girsanov kernel. Given theoretical initial emission price depending 
on the parameters of the Girsanov kernel, these parameters shall be adjusted to match 
the observed emission prices. Similar techniques have been applied in financial mod- 
eling under the framework of implicit model calibration. Being one of the central 
questions in quantitative finance, the connection between risk-neutral and objective 
measures has been successfully addressed over the recent decades. In view of this 
development, modeling from a risk-neutral measure perspective can be based on a 
variety of different methods, ranging from benchmark approach, estimation of risk 
premia, state price density from portfolio optimization theory, to several econometric 
methods for the estimation of the so-called market price of risk. 

Finally, the performance of the cap-and-trade mechanism can be examined leveraging 
on the dependence of the major economic indicators, i.e. total consumers’ costs 
PqDq, total (producers’) production costs Co(§q*)’ carbon dioxide emission 

£’o(§q*), on the controls available to the regulator. 

Note that in the standard scheme the regulator controls two key parameters: the 
total allowance allocation yo and the penalty size tt . The performance of regulation 
could be assessed in terms of relation between the increase of consumers’ costs versus 
the achieved emission reduction. Such analysis may uncover and visualize inappro- 
priate market architectures, where unlucky choices of yo and tt cause consumers 
to pay too much, compared to emission savings. Complementary or supplementary 
policies can be evaluated at this point next to the cap-and-trade system. In particular, 
different forms of subsidies and carbon tax mechanisms can have a strong impact on 
the merit order of different technologies. 



7 Conclusions 



In this paper, we show how equilibrium analysis and optimization of an environmental 
market can be carried out under the realistic assumption of risk-averse market players. 
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This generalization is based on a novel approach. Thereby, we obtain a number of 
interesting observations, which allow studying equilibrium market situations in terms 
of aggregated market quantities under a risk-neutral measure. Our findings show 
how market design optimization can be achieved incorporating risk-aversion. The 
choice to develop our approach in one-period setting, yields explicit results which 
constructively contribute to better understand the working principles of financial 
instruments and to improve both effectiveness and efficiency of environmental policy. 

Open Access This chapter is distributed under the terms of the Creative Commons Attribution 
Noncommercial License, which permits any noncommercial use, distribution, and reproduction in 
any medium, provided the original author(s) and source are credited. 
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Exponential Ergodicity 

of the Jump-Diffusion CIR Process 



Peng Jin, Barbara Rudiger and Chiraz Trabelsi 



Abstract In this paper we study the jump-diffusion CIR process (shorted as JCIR), 
which is an extension of the classical CIR model. The jumps of the JCIR are intro- 
duced with the help of a pure-jump Levy process {Jt,t > 0). Under some suitable 
conditions on the Levy measure of {Jt,t > 0), we derive a lower bound for the tran- 
sition densities of the JCIR process. We also find some sufficient conditions under 
which the function V(x) = x, v > 0, is a Forster-Lyapunov function for the JCIR 
process. This allows us to prove that the JCIR process is exponentially ergodic. 

Keywords CIR model with jumps • Exponential ergodicity • Forster-Lyapunov 
functions • Stochastic differential equations 

MSC: 60H10 -60160 



1 Introduction 

The Cox-Ingersoll-Ross model (or CIR model) was introduced in [1] by Cox et al. 
in order to describe the random evolution of interest rates. The CIR model captures 
many features of the real world interest rates. In particular, the interest rate in the 
CIR model is non-negative and mean-reverting. Because of its vast applications in 
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mathematical finance, some extensions of the CIR model have been introduced and 
studied, see e.g. 12, 5, 15]. 

In this paper we study an extension of the CIR model including jumps, the so- 
called jump-diffusion CIR process (shorted as JCIR). The JCIR process is defined as 
the unique strong solution X := (Xt, ^ > 0) to the following stochastic differential 
equation 

dXt = a(e - Xt)dt + ay/Y,dWt + dJt, Zq > 0, (1) 

where < 2 , a > 0, 0 > 0 are constants, (Wt,t > 0) is a 1 -dimensional Brownian 
motion and (Jf,t > 0) is a pure-jump Levy process with its Levy measure v con- 
centrated on (0, oo) and satisfying 

[ (§ A l)v(J§) < 00, (2) 

J (0,oo) 

independent of the Brownian motion (Wt,t > 0). The initial value Xq is assumed 
to be independent of (Wf,t > 0) and (Jt,t > 0). We assume that all the above 
processes are defined on some filtered probability space P)- We 

remark that the existence and uniqueness of strong solutions to (1) are guaranteed 
by 17, Theorem 5.1]. 

The term a(0 — Xt) in (1) defines a mean reverting drift pulling the process 
towards its long-term value 0 with a speed of adjustment equal to a. Since the 
diffusion coefficient in the SDE (1) is degenerate at 0 and only positive jumps are 
allowed, the JCIR process (Xt ,t>0) stays non-negative if Xq > 0. This fact can be 
shown rigorously with the help of comparison theorems for SDEs, for more details 
we refer the readers to 17] . 

The JCIR defined in (1) includes the basic affine jump-diffusion (or BAJD) as a 
special case, in which the Levy process (Jt,t > 0) takes the form of a compound 
Poisson process with exponentially distributed jumps. The BAJD was introduced by 
Duffie and Garleanu 12] to describe the dynamics of default intensity. It was also 
used in 15, 12] as a short-rate model. Motivated by some applications in finance, the 
long-time behavior of the BAJD has been well studied. According to 112, Theorem 
3.16] and 110, Proposition 3.1], the BAJD possesses a unique invariant probability 
measure, whose distributional properties were later investigated in 19, 11]. We remark 
that the results in 110, 1 1] are very general and hold for a large class of affine process 
with state space R+, where R+ denotes the set of all non-negative real numbers. The 
existence and some approximations of the transition densities of the BAJD can be 
found in 16] . A closed formula of the transition densities of the BAJD was recently 
derived in 19] . 

In this paper we are interested in two problems concerning the JCIR defined in (1). 
The first one is to study the transition density estimates of the JCIR. Our first main 
result of this paper is a lower bound on the transition densities of the JCIR. Our idea 
to establish the lower bound of the transition densities is as follows. Like the BAJD, 
the JCIR is also an affine processes in R+. Roughly speaking, affine processes are 
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Markov processes for which the logarithm of the characteristic function of the process 
is affine with respect to the initial state. Affine processes on the canonical state space 
X have been investigated in [3, 5, 13, 14]. Based on the exponential-affine 
structure of the JCIR, we are able to compute its characteristic function explicitly. 
Moreover, this enables us to represent the distribution of the JCIR as the convolution 
of two distributions. The first distribution is known and coincides with the distribution 
of the CIR model. However, the second distribution is more complicated. We will 
give a sufficient condition such that the second distribution is singular at the point 0. 
In this way we derive a lower bound estimate of the transition densities of the JCIR. 

The other problem we consider in this paper is the exponential ergodicity of 
the JCIR. According to the main results of [10] (see also [12]), the JCIR has a 
unique invariant probability measure tt , given that some integrability condition on 
the Levy measure oi > 0) is satisfied. Under some sharper assumptions we 
show in this paper that the convergence of the law of the JCIR process to its invariant 
probability measure under the total variation norm is exponentially fast, which is 
called the exponential ergodicity. Our method is the same as in [9], namely we show 
the existence of a Forster-Lyapunov function and then apply the general framework 
of [16-18] to get the exponential ergodicity. 

The remainder of this paper is organized as follows. In Sect. 2 we collect some 
key facts on the JCIR and in particular derive its characteristic function. In Sect. 3 we 
study the characteristic function of the JCIR and prove a lower bound of its transition 
densities. In Sect. 4 we show the existence of a Forster-Lyapunov function and the 
exponential ergodicity for the JCIR. 



2 Preliminaries 

In this section we use the exponential- affine structure of the JCIR process to derive 
its characteristic functions. 

We recall that the JCIR process {Xt , ^ > 0) is defined to be the solution to (1) and 
it depends obviously on its initial value Xq. From now on we denote by (X^ ,t >Q) 
the JCIR process started from an initial point x > 0, namely 



Since the JCIR process is an affine process, the corresponding characteristic func- 
tions of {X^ ,t > 0) is of exponential-affine form: 



dXf = a{6 - Xf)dt + ayfxfdW, +dJ,, X 




( 3 ) 



£[e“V] = :={u€C:dtu<0], 



( 4 ) 
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where denotes the real part of u and the functions (j){t,u) and \lr{t,u) in turn are 
given as solutions of the generalized Riccati equations 

= F[f{t,u)), 0(0, w) = 0, 

dfij/it, u) = R(0(^, m)), 0(0, u) = u e ^ , 

with the functions F and R given by 



F(u) = aOu + / — l)v(J§), 

J (0,oo) 

2 2 

R(u) = au. 

2 

Solving the system (5) gives (j)(t,u) and in their explicit forms: 



0(^, u) 






and 



( 6 ) 



2a9 , 

4>{t,u) = ^log(l- 



f [ (7) 

Jo J(0,oo) ^ 



Here the complex- valued logarithmic function log(-) is understood to be its main 
branch defined on C \ {0}. For t >0 and u e ^ we define 

(P\{t,u,x) - —u{\ - e 0) -^^expl ), 

2a Vl _ _e-ar)/ 

(P 2 (t, u) := exp (iT(^ 



where the complex-valued power function ^ _ (2a0la^) logz) is 

also understood to be its main branch defined on C \ {0}. One can notice that (p 2 (t, u) 
resembles the characteristic function of a compound Poisson distribution. 

It follows from (4), (6) and (7) that the characteristic functions of {X^ , r > 0) is 
given by 

= (pi{t,u,x)(p2{t,u), M G (9) 



where u, x) and (p 2 (t, u) are defined by (8). 

According to the parameters of the JCIR process we look at two special cases: 
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2.1 Special Case (i): v = 0^ No Jumps 



Notice that the case v = 0 corresponds to the classical CIR model (Yt,t > 0) 
satisfying the following stochastic differential equation 

JFf = a(e - Y^)dt + a^dWt, = x > 0. (10) 

It follows from (9) that the characteristic function of {Y^ , ^ > 0) coincides with 
(pi(t, u, x), namely forue^ 



E[e‘*^‘] = <pi{t,u,x). (11) 

It is well known that the classical CIR model (Y^ , t > 0) has transition density 
functions f(t,x,y) given by 



q_ 

f(t,x,y) = lg{2(uv)^) 



( 12 ) 



for r > 0, X > 0 and y > 0, where 

2a 



!^1 _ e-at^ ’ 



V = Ky, 



2a0 



and Iq (•) is the modified Bessel function of the first kind of order q. For x = 0 the 
formula of the density function /(r, x, y) is given by 



/(r,0, y) = v^e~^ 



(13) 



for r > 0 and y > 0. 



2.2 Special Case (ii): 0 = 0 and x = 0 

We denote by (Z^, ^ > 0) the JCIR process given by 



dZt = —aZtdt -\- cf^f^tdWt -\- dJt, Zq = 0. (14) 



In this particular case the characteristic functions of (Z^ , r > 0) is equal to ^2 (C ^) , 
namely for w g 



£’[^“^^1 = (flit, u). 



(15) 
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3 A Lower Bound for the Transition Densities of JCIR 



In this section we will find some conditions on the Levy measure v of ^ > 0) 
such that an explicit lower bound for the transition densities of the JCIR process 
given in (3) can be derived. As a first step we show that the law of Xf , t > 0, in (3) 
is absolutely continuous with respect to the Lebesgue measure and thus possesses a 
density function. 

Lemma 1 Consider the JCIR process (Xf , t > 0) (started from v > 0) that is 
defined in (3). Then for any t > f) and x > 0 the law of Xf is absolutely contin- 
uous with respect to the Lebesgue measure and thus possesses a density function 
p(t,x, y), y > 0. 

Proof As shown in the previous section, it holds 

u,x)(p2(t, u) = E[e'*^‘ 

therefore the law of Xf , denoted by pix ^ , is the convolution of the laws of Yf and Zt . 
Since (Y^ ,t >f)) is the well-known CIR process and has transition density functions 
f(t,x,y),t > 0, X, y > 0 with respect to the Lebesgue measure, thus pixf is also 
absolutely continuous with respect to the Lebesgue measure and possesses a density 
function. 

In order to get a lower bound for the transition densities of the JCIR process we 
need the following lemma. 

Lemma 2 Suppose that § ln(l/§)v(J§) < oo. Then (p 2 defined by (8) is the 
characteristic function of a compound Poisson distribution. In particular, P(Zt = 
0) > 0 for all t > 0, where (Zt,t > 0) is defined by (\ A). 

Proof It follows from (6), (8) and (15) that 
= nit, u) = exp ^ (exp 

where u e Define 

A := f f (exp ( , 

Jo 7(0, CO) VI - (or2/2a)(l - e / 



If we rewrite 



exp 



(t3 



(cr2/2a)(l - 



m) 




( 16 ) 
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where 



a \— 



2a^ 



> 0 , 



- 1 ) 

2ae^^ 

P := > 0 , 



(17) 



then we recognize that the right-hand side of (16) is the characteristic function of a 
Bessel distribution with parameters a and p. Recall that a probability measure iia,^ 
on (R+, =^(R+)) is called a Bessel distribution with parameters a and P if 



l^a,p(dx) = e ^ 8 o(dx) -h pe 



-a-^x 



apx )ax, 



(18) 



where 8q is the Dirac measure at the origin and I\ is the modified Bessel function of 
the first kind, namely 






U2\^ 









r G R. 



For more properties of Bessel distributions we refer the readers to [8, Sect. 3] (see 
also [4, p. 438] and [9, Sect. 3]). Let jla,^ denote the characteristic function of the 
Bessel distribution with parameters a and P which are defined in (17). It follows 
from [9, Lemma 3.1] that 



= exp = exp (y 



— (a^l2a){\ — e~^^)u 



)■ 



Therefore 



= / / (A«,^(m) - \)v{d^)ds 

Jo J(0,oo) 

= [ f (e^ -e-“ +e~“ -l)v{d^)ds. 
Jo i(0,oo) '' ^ 



Set 



v(d^)ds 



X:=fl ( 1-0 

Jo J(0,oo) 

n (l - e~^^^^)]v(d^)ds. (19) 

9,00) ^ ^ 
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If A < 00 , then 



A= [ [ (e^ - e-^)v(d^)ds - X 

Jo J(0,oo) ^ 

= d~ f f {e^ -e-^)v(d^)ds -l). 

Jo J(0,oo) ' 

The fact that X < oo will be shown later in this proof. 

Next we show that the term (1 /A.) (exp (au/(j3—u))—exp(—a))v(d$)ds 

can be viewed as the characteristic function of a probability measure p. To define p, 
we first construct the following measures 



ma,fi(dx) := pe “ J 



X > 0, 



where I\ is the modified Bessel function of the first kind. Noticing that the measure 
is the absolute continuous component of the measure in (18), we easily 
get 

au 



where ma,^(u) := e^^ma,^(dx) forue^. Recall that the parameters a and ^ 

defined by (17) depend on the variables § and 5'. We can define a measure p on R+ 
as follows: 



P := 7 / [ rua,^ v{d^)ds. 
X Jo 7(0, oo) 



By the definition of the constant A in (19) we get 



P(R+) = 



1 

A 

1 

A 



/'/ 

Jo 7(( 

/'/ 

Jo 7(( 



ma,p{^+)v{d^)ds 
(1 — e~^)v(d^)ds 



= 1 , 



i.e. p is a probability measure on R+, and for w g 



p(u) = / p{dx) 

7(0, oo) 



=U'I < 

^ Jo 7(0, 00) 



-e~^)v(d^)ds. 



Thus A = X(p(u) — 1) and E[e^^^] = is the characteristic function of a 

compound Poisson distribution. 
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Now we verify that X < oo. Noticing that 



)v(d^)ds 



X = ff 

Jo 7(0, 00 ) 

n (l - e~^^^^Av(d^)ds 

3,00) ^ 

=/ /'( 

J(0,oo) Jo ^ 



2a^ 



^dsv(d^), 



we introduce the change of variables 



dy = - 



2a^ 



cr^^e^s _ 

2a^ 



:= y and then get 



= 



_ \y. 

) cr^ /2a^ 



ae^^ds 



i^^l)ds. 
Va^y J 



Therefore 



2a^ 

X= [ v(d^) 

J {0,OO) J OQ 

= [ V{d^) f 

J(0,co) J- 



-2t 



2a^y + cr^y^ 



dy 



M 2a^y + a^y^ 

cr^(e^^ — l) 



dy 



= f v(d^) f 

J(0,oo) J% 



v(d^) I (1 - e-y) 



2^ 



2a^y + cr^y^ 



dy, 



where 8 := . Define 



2a 



-i 



M{^):= I (l-e-y) 



2^ 



ydy. 



2a^y + cr^y^ 

Then M(§) is continuous on (0, oo). As § ^ 0 we get 

2? , , o. r.. -V. dy 



M{^)= I (l-e-y) 



i: 

8 

'J T 



J 



I la^y + a'^y'^ 
+ (J^y 



dy + 2^ 



/ OO 

1 



2a^y + G^y^ 2a^y + a^y^ 



i: 



2a^y + a^y^ 



dy 



/•' 1 /•~ 1 
L o 2 + / — 

Jl 2a^+(T^y Jl a^y 



;dy. 
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Since 






2a^ -|- 



+ cr^y 



dy = ^ 

L 



ln(2a^ + cr^y) 



2$ 9 

= — T \n(2a^ + a^) ^ ln(2a^ H — ) 

(7^ 6 

1 1 

< ci§ + C 2 § In(-) < C 3 § In(-) 

? ? 



for sufficiently small § , we conclude that 



1 

M(§) < C 4 ?ln(-), as § ^ 0. 

§ 



If § ^ 00 , then 



M(§) < / (1 - 



/i 

/. 






2a^)> + 



dy 



2t 



i 2a%y 

h 

roo 



Jy < 



A 



rJy 



Therefore, 



A- < C4 



1 2§r 

d(—) = M -- 
J| y yJ 



2t 5 28 

-^- = ^ :=C5 < 00. 
(T^ § 



^^§ln(l)v(J|)+C5 r lv(J§) < 00. 



With the help of the Lemma 2 we can easily prove the following proposition. 

Proposition 1 Let p(t, x, y), t > 0, x, y > 0 denote the transition density of the 
JCIR process (Xf , t > 0) defined in (3). Suppose that f^Q § ln(|)v(J§) < oo. 
Then for all t > 0, x, y > Owe have 



pit, X, y) > P{Zt = 0)/(L X, y), 

where P(Zt = 0) > 0 for all t > 0 and f(t,x,y) are transition densities of the CIR 
process (without jumps). 

Proof According to Lemma 2, we have P(Zt = 0) > 0. Since 



E[e^^t ] = (pi(t, u, x)(f 2 it, u) = E{e^^^ ]E[e^^^], 
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the law of Xf, denoted by is the convolution of the laws of and Z^. Thus 
for all A e ^(R+) 



IJ-xf(A)= I^Y,’^(A-y)nz,(dy) 
J R-i- 

> / iiyyiA- y)iiz,{dy) 

Jm 

> ^lY^{A)^lz,{{^}) 
>P(Z,= 0)hyHA) 



> P(Z,=0) [ f{t,x,y)dy, 

Ja 

where f(t,x,y) are the transition densities of the classical CIR process given in 
(12). Since A e ^(R+) is arbitrary, we get 



p(t, X, y) > P{Zt = 0)/(C X, y) 



for all r > 0, X, y > 0. 



4 Exponential Ergodicity of JCIR 

In this section we find some sufficient conditions such that the JCIR process is 
exponentially ergodic. We have derived a lower bound for the transition densities of 
the JCIR process in the previous section. Next we show that the function V (x) = x, 
X > 0, is a Forster-Lyapunov function for the JCIR process. 

Lemma 3 Suppose that ^v(d$) < oo. Then the function V(x) = x, x > 0, 
is a Forster-Lyapunov function for the JCIR process defined in (3), in the sense that 
for all t > 0, X > 0, 

^[V(Xf)] < e~^W{x)FM, 
where 0 < M < oo is a constant. 

Proof We know that [ixf = l^yf ^ P'Zt^ therefore 

E[Xf] = E[Y,^]FE[Ztl 

Since (Yf , t > 0) is the CIR process starting from x, it is known that pLyf is a 
non-central Chi-squared distribution and thus E[Yf] < oo. Next we show that 
E[Zti < 00 . 

Let u G (— 00 , 0). By using Fatou’s Lemma we get 
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r - 1 -| 

E[Zt] = e\ Urn 

\-u — >-0 li J 



< liminf E 









lim inf 

u^O U 



Recall that 



£[e»2,] ^ m) = exp ( / / (e i-(-^/Za-e-“^)u _ l)v{d^)ds) = 

^Jo J{0,oo) ^ ^ 



Then we have for all w < 0 



d 

du 



(exp(j 



^ue 



— (a^/2a)(l—e 

^e~as 



H 



< 



(1 - (cr2/2a)(l -e-^Ow)' 

^e~as 

(1 - (a^ /2a){l - e-“^)uy 



rexp 



(t 3 



^ue 



i(T^I2a)(l - 



m) 



< ^e- 



and further 



/7 ■ 

Jo J{0,oo) 



^e~^^v{d^)ds < 00 . 



Thus A{u)\^ differentiable in u and 



Zi'(O) = f [ He~‘^^v{d^)ds = j 

Jo 7(0, oo) ^ 7(0, oo) 



It follows that 



E[Zt] < liminf 

u — >-0 



ip2it, U) - (P2(t, 0) 



d(p 2 it, u) 



du 
1 - 

a 



u=0 



u 



L 



(0,oo) 



^v{d^). 



Therefore under the assumption / ) < oo we have proved that E[Zf] < 

7(0, oo) 

00 . Furthermore, 



£[Z,] = 

du\ / 



I — e 



u=0 



[ $v(d$). 

^ 7(0,oo) 
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On the other hand, 

] = (1 - {cf^/la)u{\ - gxp 

With a similar argument as above we get 



(- 



xue 



(cr2/2a)M(l -e-“0 



)■ 




= 6{\ - e~^^) ^ xe~^^ . 



Altogether we get 



] = E[Y^] + E[Zt] 

1 - 

= (1 - e -^^){6 + ) + xe ~^^ 

a 

<e \-xe~^\ 

a 



namely 

E[V{X^)] < 6> + - + e~^W(x). 
a 

Remark 1 If $v(d^) < oo, then there exists a unique invariant probability 
measure for the JCIR process. This fact follows from [12, Theorem 3.16] and [10, 
Proposition 3.1]. 

Let II • II 7 y denote the total-variation norm for signed measures on R+, namely 

WiiWtv = sup {|/x(A)|}. 

A€^(R+) 

Let P\x,') \= P (Xf G •) be the distribution of the JCIR process at time t started 
from the initial point v > 0. Now we prove the main result of this paper. 

Theorem 1 Assume that 

f § < 00 and f § ln(-)v(J§) < oo. 

7(1,00) 7(0,1) ? 

Let 7t be the unique invariant probability measure for the JCIR process. Then the 
JCIR process is exponentially ergodic, namely there exist constants 0 < f < \ and 
0 < B < oo such that 

\\P‘{x,-) -tzWtv < B{x + , t>0, xeR+. (20) 

Proof Basically, we follow the proof of [18, Theorem 6.1]. For any 5 > 0 we 
consider the 5 -skeleton chain := X^^, ^ g Z+, where Z+ denotes the set of all 
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non-negative integers. Then is a Markov chain on the state space R+ with 

transition kernel P^(x, •) and starting point = x. It is easy to see that the measure 
7T is also an invariant probability measure for the chain (r]^)neZ+, x > 0. 

Let V (x) = x,x >0. It follows from the Markov property and Lemma 3 that 

f V{y)P\nl,dy) < e-^^V{r^D + M, 

JR+ 

where M is a positive constant. If we set Vo := V and Vn := V ^ G N, then 

E[Vi] < e-^^Voix) + M 
and 

neN. 

Now we proceed to show that the chain (r]^)neZ+,x > 0, is A. -irreducible, strong 
aperiodic, and all compact subsets of R+ are petite for the chain (r]^)neZ+- 

''X-irreducibility'': We show that the Lebesgue measure X on R+ is an irreducibil- 
ity measure for (r]f^)neZ+- Let A e ^(R+) and X(A) > 0. Then it follows from 
Proposition 1 that 



P[Vi e A\r]^o =x] = P(Xl e A) > P(Zs = 0) ^ f(8, x, y)dy > 0, 

since f(8,x,y) > 0 for any x G R+ and y > 0. This shows that the chain (r]^)neZ+ 
is irreducible with X being an irreducibility measure. 

''Strong aperiodicity'\sQQ [16, p. 561] for a definition): To show the strong aperi- 
odicity of {r]^)neZQ, we need to find a set 5 G =^(R+), a probability measure m with 
m{B) = 1, and £ > 0 such that 

L(x,5)>0, xgR+, (21) 

and 

P{y]\ e A) >sm{A), X G C, A g ^(R+), (22) 

where L(x, B) := P{r]^ G B for some n G N). To this end set B := [0, 1] and 
^(y) := inf;ce[0,l] /(^. y > 0. Since for fixed y > 0 the function /(5, x, y) 
is strictly positive and continuous in x G [0, 1], thus we have g(y) > 0 and 0 < 
/( 0 ,i]g(yWy < 1- Define 

•= 7 Vt:T [ Siy)dy. A e ^(R+). 

J( 0 ,i] 8(y)dy 7/in(0,i] 
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Then for any x G [0, 1] and A e ^(R+) we get 



P{r^\ e A) = P{Xl e A) 
> P(Zs = 0) 


f f(S,x,y)dy 


> P(Zs = 0) 


'a 

/ g{y)dy 




'/in(0,i] 



> P{Zs = O)m(A) [ g(y)dy, 
^( 0 , 1 ] 



so (22) holds with s := P(Zs = 0) g(y)dy. 

Obviously 

L(x, [0, 1]) > P(r]\ G [0, 1]) = G [0, 1]) > P(Zs = 0) [ f(8, x, y)dy > 0 

^[0,1] 

for all X G R+, which verifies (21). 

''Compact subsets are petite'': We have shown that X is an irreducibility measure 
for (^^)neZ+- According to [16, Theorem 3.4(ii)], to show that all compact sets are 
petite, it suffices to prove the Feller property of {r]^)neZ^, x > 0. But this follows 
from the fact that {r]^)neZ+ is a skeleton chain of the JCIR process, which is an affine 
process and possess the Feller property. 

According to [16, Theorem 6.3] (see also the proof of [16, Theorem 6.1]), the 
probability measure tt is the only invariant probability measure of the chain (^^ )neZ+ , 
X > 0, and there exist constants ^0 G (0, 1) and C G (0, oo) such that 

||P^"(x, •) -Trllrv < c(x + n e Z+, x e R+. 

Then for the rest of the proof we can proceed as in [18, p. 536] and get the 
inequality (20). 
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Optimal Control of Predictive Mean-Field 
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Abstract We study a coupled system of controlled stochastic differential equations 
(SDEs) driven by a Brownian motion and a compensated Poisson random measure, 
consisting of a forward SDE in the unknown process X (t) and a predictive mean-field 
backward SDE (BSDE) in the unknowns Y (0, Z(t), K (t, •)• The driver of the BSDE 
at time t may depend not just upon the unknown processes Y(t), Z(t), K(t, •), but 
also on the predicted future value Y (t + 5), defined by the conditional expectation 
A(t) := E[Y(t-\-8)\^t]’ We give a sufficient and a necessary maximum principle for 
the optimal control of such systems, and then we apply these results to the following 
two problems: (i) Optimal portfolio in a financial market with an insider influenced 
asset price process, (ii) Optimal consumption rate from a cash flow modeled as a 
geometric Ito-Levy SDE, with respect to predictive recursive utility. 
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1 Introduction 



The purpose of this paper is to introduce and study a pricing model where beliefs 
about the future development of the price process influence its current dynamics. We 
think this can be a realistic assumption in price dynamics where human psychology 
is involved, for example in electricity prices, oil prices and energy markets in general. 
It can also be a natural model of the risky asset price in an insider influenced market. 
See Sect. 5.1. 

We model such price processes as backward stochastic differential equations 
(BSDEs) driven by Brownian motion and a compensated Poisson random measure, 
where the coefficients depend not only of the current values of the unknown processes, 
but also on their predicted future values. These predicted values are expressed mathe- 
matically in terms of conditional expectation, and we therefore name such equations 
predictive mean-field equations. To the best of our knowledge such systems have 
never been studied before. 

In applications to portfolio optimization in a financial market where the price 
process is modeled by a predictive mean-field equation, we are led to consider coupled 
systems of forward-backward stochastic differential equations (FBSEDs), where the 
BSDE is of predictive mean-field type. In this paper we study solution methods for 
the optimal control of such systems in terms of maximum principles. Then we apply 
these methods to study 

(i) optimal portfolio in a financial market with an insider infiuenced asset price 
process. (Sect. 5.1), and 

(ii) optimal consumption rate from a cash flow modeled as a geometric Ito-Levy 
SDE, with respect to predictive recursive utility (Sect. 5.2). 



2 Formulation of the Problem 

We now present our model in detail. We refer to [5] for information about stochastic 
control of jump diffusions. 

Let 5(0 = B{t, (jo)\ {t, co) G [0, oo)x^2 and (Jr, dt;) = N(dt, d^) — v(d^)dt 
be a Brownian motion and an independent compensated Poisson random measure, 
respectively, on a filtered probability space (^2, E, F = {^t}t>0, P) satisfying the 
usual conditions. We consider a controlled system of predictive (time- advanced) 
coupled mean-field forward-backward stochastic differential equations (FBSDEs) 
of the form (T > 0 and 5 > 0 are given constants) 

• Forward SDE in X(t): 

dX{t) = dX^(t) = bit, X(t), Y(t), A(t), Z(t), K(t, •), u(t), co)dt 
+G{t, X(t), Y(t), Ait), Z(0, Kit, •), uit), co)dBit) 

+ Ir y(A Kit), Yit), Ait), Z(0, Kit, •), uit), 0 co)Nidt, di;) \ t e [0, T] 

^X(O) 
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• Predictive BSDE in Y{t), Z(t), K(t): 



dY(t) = -g(t, X(t), 7(0, A(0, Z(0, K(t, •), u{t), co)dt + Z{t)dB{t) 
+ t€[0,T) 

Y(T) =h{X(T),co). 



Y(t):= L; t e(T,T + S], (2) 

where L is a given bounded ^-measurable random variable, representing a “ceme- 
tery” state of the process 7 after time T. The process A(t) represents our predictive 
mean-field term. It is defined by 

Mt) := E{Y(t + 5) I ^0 ; t e [0, Tl (3) 

Here ^ is the set of functions from Mq •= into M, /z(x, a;) is a function 

(with respect to x) from R x into M such that h(x, •) is -measurable for all x, 
and 



g:[0,T]xRxRxRxRx^xVxf2^R 

is a given function (driver) such that g(t, x, y, a, z, k, u, ') is an F-adapted process 
for all X, y, (2, z eR,k e ^ and u eV, which is the set of admissible control values. 
The process u(t) is our control process, assumed to be in a given family ^ 
of admissible processes, assumed to be cadlag and adapted to a given subfiltration 
G = {%}t>Q of the filtration F, i.e. % c for alU. The sigma-algebra represents 
the information available to the controller at time t. 

We assume that for all w g ^ the coupled system (l)-(3) has a unique solution 
X{t) = Z“(r) e L^{m x P),Y{t) = F“(0 € L^{m x P), A(r) = A“(r) € 
L^{m X P), Z{t) = Z“{t) e L^(m x P), K(t, O = O e L^{m x v x P), 

with X(t), Y(t), A(t) being cadlag and Z(t), K(t, being predictable. Here and 
later m denotes Lebesgue measure on [0, 7]. 

To the best of our knowledge this system, (l)-(3), of predictive mean-field FBS- 
DEs has not been studied before. However, the predictive BSDE (l)-(3) is related to 
the time-advanced BSDE which appears as an adjoint equation for stochastic con- 
trol problems of a stochastic differential delay equation. See [7] and the references 
therein. 

The process A(t) models the predicted future value of the state 7 at time t 
Therefore (l)-(3) represent a system where the dynamics of the state is influenced by 
beliefs about the future. This is a natural model for situations where human behavior 
is involved, for example in pricing issues in financial or energy markets. 
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The performance functional associated to w G is defined by 



J(u) = E 




, A(0, T(0, A(0, u{t), co)dt + (p(X(T), co) + iA(F(0)) 



(4) 



where f : [0, T] x R x R x V x Q ^ R, cp : R x Q ^ M. and \j/ : R ^ R 
are given functions, with f(t,x,y,a,u, •) being F-adapted for all x, y, <2 G M, 
w G U. We assume that (p{x, •) is -measurable for all x. 

We study the following predictive mean-field stochastic control problem: 

Find M* G ^ such that 



sup J(u) = J(u*). (5) 

ue^ 



In Sect. 3 we give a sufficient and a necessary maximum principle for the optimal 
control of forward-backward predictive mean-field systems of the type above. 

An existence and uniqueness result for predictive mean-field BSDEs is given in 
Sect. 4. 

Then in Sect. 5 we apply the results to the following problems: 

• Portfolio optimization in a market where the stock price is modeled by a predictive 
mean-field BSDE, 

• Optimization of consumption with respect to predictive recursive utility. 



3 Solution Methods for the Stochastic Control Problem 
3.1 A Sufficient Maximum Principle 



Eor notational simplicity we suppress the dependence oi coin f, g,h, cp and \l/ in the 
sequel. We first give sufficient conditions for optimality of the control u by modifying 
the stochastic maximum principle given in, for example, [6], to our new situation: 
We de^ne the Hamiltonian H : [0, T] xMxMxMxMx^^xUxMxMxMxM) ^ 
M associated to the problem (5) by 



H(t, X, y, a, z, k, u, p, q, r, X) = f(t, x, y, a, u) + b{t, x, y, a, z, k,u)p a(t, x, y, a, z, k, u)q 

+ / y{t,x,y,a,z,k,u,i;)N{dt,di;) g{t,x,y,a,z,k,u)X. 

( 6 ) 



We assume that f, b, a, y and g, and hence H, are Erechet differentiable (C^) in 
the variables x,y,a, z,k,u and that the Erechet derivative of H with respect 
to k G as a random measure is absolutely continuous with respect to v, with 
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Radon-Nikodym derivative . Thus, if {VkH, h) denotes the action of the linear 



dv 



operator on the function h e ^ wo have 




(7) 



The associated backward-forward system of equations in the adjoint processes 
p(t), q(t), r(t), X(t) is defined by 

• BSDE in p(t), q(t), r(t): 

t dH r ^ 

dp{t) = - — {t)dt q{t)dB{t) ^ / r{t , t;)N {dt , dt;) \ 0<t<T 

dx Jm ( 8 ) 

p(T) =(p\X(T))+X(T)h\X(T)). 

• SDE in X(t): 



where we have used the abbreviated notation 

H(t) = H(t, X(t), Y(t), A(t), Z(t), K(t, •), u(t), p(t), q(t), r(t), 

Note that, in contrast to the time advanced BSDE (l)-(3), (9) is a (forward) stochastic 
differential equation with delay. 

Theorem 1 (Sufficient maximum principle) Let u e ^ with corresponding solution 
X(t), F(0, A(0, Z(0, Kit, •), pit), qit). fit). Ht) of(l)-(3), (8) and (9). Assume 
the following: 






f dVkH 

+ / — ON(dt, dO\ 0 < r < r 



(9) 



7r dv 

A(0) = 



X{T) > 0 



( 10 ) 



• For all t, the functions 




( 11 ) 



are concave 
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• For all t the following holds, 

(The conditional maximum principle) 

ess sup E[H(t, X(t), Y(t), A(t), Z(t), K(t, •), v, i(t), p(t), q(t), r(t, •)) I %] 

veU 

= E[H(t, X(t), 7(0, A(0, Z(0, K(t, •), u(t), i(t), p(t), q(t), r(t, 0) I ^0 ; t e [0, T] 

( 12 ) 






dVkHjt, .) 
dv 



< oofor all t G [0, T]. 



(13) 



Then u is an optimal control for the problem (5). 

Proof By replacing the terminal time T by an increasing sequence of stopping times 
Tn converging to T as ^ goes to infinity, and arguing as in [6] we see that we may 
assume that all the local martingales appearing in the calculations below are martin- 
gales. 

Much of the proof is similar to the proof of Theorem 3.1 in [6], but due to the 
predictive mean-field feature of the BSDE (l)-(3), there are also essential differences. 
Therefore, for the convenience of the reader, we sketch the whole proof: 

Choose u e ^ and consider 



J (u) — J (u) — 7i + /2 + /s, 



(14) 



with 



h :=E 



/ 



{fit) -fit)} dt, l 2 :=E[cp{X{T))-cp{X(T))l I 3 := Jf iY (0)) - (Y (0)) , 

where /(O = fit, fit), Ait), u it)) etc., mid fit) = Y^it) is the solution of (l)-(3) 
when u = u, and Ait) = E[f it) \ 

By the definition of H we have 



h 



= E 




{Hit) - Hit) - pit)bit) - qit)dit) 
r(f, - X(r)g(0 , 



( 16 ) 



where we from now on use the abbreviated notation 



Hit) = Hit, Xit), Yit), Ait), Zit), Kit, •), uit), if)) 
Hit) = Hit, Xit), fit). Ait), Zit), Kit, •), uit), if)) 
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and we put 



b(t) := b(t) — b(t), 

and similarly with X(0 \= X(t) — X{t),Y(t) \= Y (t) — Y (t) , A(t) \=A{t) — A{t), 
etc. 

By concavity of (p, (9) and the Ito formula, 

h < E[ip\X(T))X{T)] 

= E{p{T)X{T)^ - E{i{T)h\X{T))X{T)^ 

rT nT nT 

p(t~)dX(t) A- / X(t~)dp{t) + / q{t)d{t)dt 



Au: 



']) 



= E 



+ / [ Ht, Or(t, OHd^dt I ) - e[UtW(X(T))X(T)] 
Jo Jr 

£ p{t)b{t)dt + £ X(t)^- 

+ [ q(t)d(t)dt A- [ [ r(C< 

Jo Jo Jr 



dH 






Oy(t,Ov{dOdt 

/O 

- E[UT)h'{X{T))X(T)]. 

By concavity of xj/ and h, (10) and the Ito formula we have 



(17) 



h<E 



xlr\Y(0))Y(0) 



= ^1A(0)7(0)] 



E[X(T)Y(T)] - E 



[ i(t)dY(t) A- [ 

Jo Jo 



X(t)dY(t)A- / Y(t)dX(t)A- / d[Y,X](t) 



f 



E[X{T){h{X{T))-h{X(Tm 
E\ f X(t)dY(t)+ [ Y(t)dX(t)+J d[Y,X]{t) 



f X(t)dY(t)+ f 
Jo Jo 

< E[X{T)h'{X{T))X{T)] 



+ E 



[ X(t)g{t)dt + f Y(t) 

Jo Jo 

dH ~ f 

+ / ~^(t)^(t)dt + / 

Jo oz Jo Jr 



dH dH 

-^(0 - —(t - ^)X[8,T](t) 
dy da 



dt 



dVkH 

dv 



(r, ^)v(dOdt 



(18) 
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Adding (16), (17) and (18) we get, by (9), 



J (u) — J (u) — 7i + /2 + /s 

rT 



< E 



/ 



dH ~ dH ~ 

H(t) - H(t) - —X(t) - —Y(t) 
ax ay 



-{XkH(t, .), K(t, •)) 

aa az 



Note that, since Y(s) = Y (s) = L for s e (T,T + 8] by (1), we get 



dt . 
(19) 





' dH 




dH ~ 


E 




= E 


/ —(s)Y(s + S)ds 

Jo da 





rT-8 


~dH 








E 


/ ^ 


— (s)F(^ +.5) 1 


dt 






Jo 


da 










■ pT-S 3 A 




~ 




' dH 


E 


- 


-{s)E 


F(^ + 5) 1 


ds 


= E 


/ —{s)A{s)ds 




Jo 


L -I 






Jo da 



. ( 20 ) 



Substituted into (19) this gives, by concavity of H, 
J (u) — J (u) = I\ 12 13 

rT 



< E 



I 



H(t) - H{t) - - X{t)) - ^(Y{t) - Y{t)) 

ax ay 



dH ^ dH 

- - A(t)) - - Z(t)) 

aa az 



< E 



^ dH 



dt 



= E 



/ - U{t))dt 

Jo ou 

rT 97^ 

/ E[—(l)m(u(l)-u(l))dt 

Jo ou 

since u = u(t) maximizes 



< 0 , 



( 21 ) 



□ 
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3.2 A Necessary Maximum Principle 

We proceed to prove a partial converse of Theorem 1, in the sense that we give 
necessary conditions for a control u to be optimal. In this case we can only conclude 
that u(t) is a critical point for the Hamiltonian, not necessarily a maximum point. 
On the other hand, we do not need any concavity assumptions, but instead we need 
some properties of the set of admissible controls, as described below. 

Theorem 2 (Necessary maximum principle) Suppose u e ^ with associated solu- 
tions X,Y , Z, K , p, q,r /X of( l)-(3) and (8) and (9). Suppose that for all processes 
f(t) of the form 

^(0 := X[tQ,T]{t)a, (22) 

where t^ G [0, T) and a = ct(a)) is a bounded -measurable random variable, 
there exists 5 > 0 such that the process 

u{t) + rf{t) G for all r G [—8, 5]. 



We assume that the derivative processes defined by 

x(t) = xHt) = |,=o, (23) 

dr 

y(t) = yf^it) = |,=o, (24) 

dr 

a{t) = af^it) = |,=o, (25) 

dr 

2(0=z^(0 = ^Z“+''^(0 lr=0, (26) 

dr 

k{t) = kf^it) = |,=0, (27) 



exist and belong to L^(m x P), L?{m x P), L^(m x P), and L^(m x P x v), 
respectively. 

Moreover, we assume that x(t) satisfies the equation 
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dx{t) — 



+ 



db db db db 

— (t)x{t) + —{t)y{t) + —{t)a{t) + —it)z{t) + {^kb, k(t, ■)) 
ax ay aa az 

db 

+ dt 

au 

da da da da 

-—(t)x(t) + -—(t)y(t) + -—(t)a(t) + ^(0^(0 + (Vy^a, k(t, •)) 

dx dy da dz 

da 

dB(t) 



+■^( 0^(0 

du 



+ 



/ 

JR 



dy dy dy dy 

/(A 0^(0 + -fit, Oyit) + fit, Oait) + fit, Ozit) 

dx dy da dz 

+iVkr(t,o,k(t,-)) + ^(t,O^(0 

du 



xi0) = 0 
and that y it) satisfies the equation 



Nidt,dO ; t e[0, T] 
(28) 



= - [If (0x(0 + ^(i)y{t) + If (0a(0 + |f(0z(0 

+ {Vkg{t),k{t, ■)) + ^^{t)m]dt 
-\-zit)dBit) + fj^kit, t;)Nidt, dt;) ; 0 < r < r 
yiT) =h\XiT))xiT) 
y(t) =0; T < t < T -\- d. 



where we have used the abbreviated notation 



9^ 

dx 



it) 



d 

dx 



git, X, y, a, Z, k, u)x=x{t),y=Y{t),a=A{t),z=z{t),k=K{t),u=u{t) etc. 



Then the following, (i) and (ii), are equivalent: 
d . 

(i) -—Jiu + r/3)r=o = 0 for all f of the form (22) 
dr 

(ii) ^E[Hit,Yit),Ait),m,Kit),u,iit))^=i,^,f^^^ 

where if. A, Z, K, X) is the solution of(l), (3) and (9) corresponding tou = u. 

Proof As in Theorem 1 , by replacing the terminal time T by an increasing sequence 
of stopping times converging to T as n goes to infinity, we obtain as in [6] that 
we may assume that all the local martingales appearing in the calculations below are 
martingales. The proof has many similarities with the proof of Theorem 3.2 in [6], 
but since there are some essential differences due to the predictive mean-field term, 
we sketch the whole proof. For simplicity of notation we drop the hats in the sequel, 
i.e. we write u instead of u etc. 
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(i) (ii): We can write — J(u -\-r^) |r=o= h h where 

dr 



d 



■ rT 

/ fit, Y^^^^it), A^^^^it), Z^+^^it), K^^^^it), u(t) + rf(t))dt 

Jo 



Jr=0 



h = ^W(X^^^^(T))]r=0 

dr 

h = :^['Aa"“+''^(0))]r=0- 

dr 



By our assumptions on / and \j/ we have 
r rT 



h 



d/ 



^(t)x(t) + ^{t)y(t) + ^(t)a(t) + ^(t)z(t) 
ax ay aa az 

au 



(30) 



h = EW{X(T)x{m = E[p{T)x{T)] 

h = f\Ymy{Q) = A(0)>>(0). 



(31) 

(32) 



By the Ito formula and (28) 
r rT 



h = E{p(T)x(T)] 



e\ r p(t) 
Jo 



= E f p(t)dx(t) + f x(t)dp(t)+ [ d[p,x^{t) 

Jo Jo Jo 

db db db db 

— {t)x{t) + — (f)y(O + —{t)a(t) + —{t)z{t) 

3 x 3 y 3 a 3 z 



] 



db 



+ (Wkbit),kit,-)) + ^{t)fit) 
ou 



/ dH \ 

dt-\- v(0 I it)\dt 

Jo \ dx J 



■f 



+ / ^(0 



d <7 da da da 

-—it)x(t) + — (0y(0 + -—it)a(t) + ~—it)zit) 

dx dy da dz 



da 



+ (Vka(t),k(t,-)) + ^m(0 

du 



dt 



fh''- 






dy dy dy dy 

-fit, t;)xit) + -fit, Oy(0 + fit, Oait) + fit, Ozit) 

dx dy da dz 






+ < Xkyit,^),kit,') > -\--—it, Ofiit) 
du 



vid^)dt 



= e\I 



L 

■L 



x{t) 

dx 

^ ^d_b 
dy 



db da f dy dH 

^(t)pit) + ^it)qit)+ / fit,Orit,OvidO-^{t) 
Jr 



yit) 



dx 

da 
— ( 
dy 



■/ 

Jr 



+ / ait) 



( 0 p (0 + — ( 0 ^( 0 + / —it,0rit,0^{d0 

db da 



dx 

dy 

Ty^ 



'-^(t)p(t)+’—(t)q{t)+ [ ^(t,i)r{t,Ov(dO 
da da Jj^ da 



dx 

dt 

dt 



dt 
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/' 



+ / Z(t) 
cT 



= E [ x(t) 



db da f dy 

— it)p(t)+—{t)q{t)+ / —{t,Or(t,Ov{dO 
oz dz Jr dz 

+ [ [ + '^k(z(t)qit) 

Jo Jr 

+ / ^ky(t,Or(.t,Ov(d^))v(d^)dt\ 

Jr J 

[/’ 

+ /%(0 

Jo 

+ f a(t) 

Jo 

+ f z(t) 

Jo 

i7. 

i 



dt 



-%{t) - X(0^(o| dt 
dx dx J 

dH df dg 

— (0-/(0-A(0/(0 

dy dy dy 

dH df dg 

— (0-^(0-A(0/(0 

da da da 

dH df dg 

dz dz dz 



dt 

dt 

dt 



+ / / k(t, 0{^kH(t) - Vkf(t) - X{tmg{t)]v{dOdt 

/ffi 
rT 



= -h-E 



m 

7' 



dH df dg 

— (0-^(0 
du du du 



dt 



m 



dg dg dg 

/(0x(0 + /(0y(0 + /(0z(0 

dx dy dz 



dg. 



+{^kgit),k{t,-)) + ^{t)fi{t) 
du 



dt 



+ E[f 

By the Ito formula and (29), 



k(t, •)> + \ dt I (33) 



h = X(0)yi0) = E X{T)y{T)-{^j^ X(t)dy(t) + y(t)dX(t) + j^ y](0^ 



= E[X{T)y{T)] 




dg dg dg 

-^(t)y(t) - ^(t)a(t) - ^(t)z(t) 
dy da dz 



-(Vkg(t),k(t,-)}- ^(t)j8(t) 
du 



dt 



y dH dH y dH 

+ / yit) — {t)dt + y{t) — {t-&)x[&,T}{t)dt+f z(t) — (t)dt 

Jo dy da Jo 

rT 



dz 



- / / k(t, 

Jo Jr 



O^kHit.OvidOdt 



])■ 



(34) 
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Adding (30), (33) and (34) and using that 

dH dH 

E[ y(t) — (t-&)X[S,T]dt] = E[ + (5) — 

Jo da Jo da 

nT pT 

= E[ —(s)E[y(s+S)\^s]ds] = E[ y{t) — (s)a(s)ds], (35) 

Jo da Jo da 



we get 



~—J{u + r^) |r=o— h + h — E 
dr 




dH 

du 






We conclude that 



^J(u + r^) |r=o= 0 
dr 



if and only if 



E 




— {t)^{t)dt 
du 



= 0 for all bounded P G ^ of the form (12.3.17). 



Since this holds for all such p, we obtain that if (i) holds, then 




dH 

du 



(t) I % 



dt = 0 for all to G [0, T). 



(36) 



Differentiating with respect to to and using continuity of (t), we conclude that 

du 

(ii) holds. 

(ii) (i): This is proved by reversing the above argument. We omit the details. □ 



4 Existence and Uniqueness of Predictive Mean-Field 
Equations 



In this section we study the existence and uniqueness of predictive mean-field BSDEs 
in the unknowns Y (t), Z(t), K (t, f ) of the form 

'dY(t) = -g(t, Y(t), A{t), Z{t), K{t, •), co)dt + Z(t)dB(t) 

+ J^K(t,ON(dt,d;)-,te[0,T) 

Y{t) = L; t g[T,T +8]; 8 >0 fixed, 



( 37 ) 
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where L g L^(P)isa given -measurable random variable, and the process A{t) 
as before is defined by 



Mt) = E{Y{t + 5) I ; t e [0, Tl (38) 

To this end, we can use the same argument which was used to handle a similar, but 
different, time-advanced BSDE in [7]. For completeness we give the details: 

Theorem 3 Suppose the following holds 

rT 

E[ / g^{t, 0, 0, 0, 0)dt] < 00 (39) 

Jo 

There exists a constant C such that 

\g{t,yi,ai,zi,ki) - g{t.y2,a2,Z2,k2)\ < C(|ji - y 2 \ + \zi - Z2\ + 

if \h(0 - k2(0\MdO)h (40) 

Jr 

for all t G [0, r], a.s. Then there exists a unique solution triple (Y (t), Z(t), K(t, ^)) 
of (37) such that the following holds: 

Y is cadlag and £’[sup^^|^Q T^(0] < cxd, 

Z, K are predictable and E[ {Z^(t) -h E^(t, ^)v(d^)jdt] < oo. 

Proof We argue backwards, starting with the interval [7 — 5,7]: 

Step 1. In this interval we have A{t) = E[L\^t] and hence we know from the theory 
of classical BSDEs (see e.g. [8, 9] and the references therein), that there exists a 
unique solution triple (Y{t), Z(t), K{t, t;)) such that the following holds: 

Y is cadlag and 7’[sup^^[j_5 7^(01 < c>o, 

Z, K are predictable and E[ fr^_^{Z^(t) -h K^(t, ^)v(d^)}dt] < oo. 

Step 2. Next, we continue with the interval [7 — 25, 7 — 5]. For t in this interval, the 
value of 7 (^ + 5) is known from the previous step and hence A(t) = E[Y (t S)\^t] 
is known. Moreover, by Stepl the terminal value for this interval, 7(7 — 5), is known 
and in L^(P). Hence we can again refer to the theory of classical BSDEs and get a 
unique solution in this interval. 

Step n. We continue this iteration until we have reached the interval [0, 7 — ^5], 
where ^ is a natural number such that 

7 - (/2-h 1)5 < 0 < 7 - ^5. 



Combining the solutions from each of the subintervals, we get a solution for the 
whole interval. □ 
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5 Applications 

In this section we illustrate the results of the previous sections by looking at two 
examples. 



5.1 Optimal Portfolio in an Insider Influenced Market 

In the seminal papers by Kyle [4] and Back [2] it is proved that in a financial market 
consisting of 

• noise traders (where noise is modeled by Brownian motion), 

• an insider who knows the value L of the price of the risky asset at the terminal 
time t = T and 

• a market maker who at any time t clears the market and sets the market price, 

the corresponding equilibrium price process (resulting from the insider’s portfolio 
which maximizes her expected profit), will be a Brownian bridge terminating at the 
value L at time t = T. In view of this we see that a predictive mean-field equation 
can be a natural model of the risky asset price in an insider influenced market. 

Accordingly, suppose we have a market with the following two investment pos- 
sibilities: 

• A risk free asset, with unit price S^it) := 1 for all t 

• A risky asset with unit price S(t) := Y (t) at time t, given by the predictive mean- 
field equation 



where /x(0 = /x(C co) is a given bounded adapted process and L is a given bounded 
-measurable random variable, being the terminal state of the process Y at time 

T. 

Let u{t) be a portfolio, representing the number of risky assets held at time t. We 
assume that G = F. If we assume that the portfolio is self-financing, the correspond- 
ing wealth process X(t) = X^(t) is given by 



dY(t) = -A(t)fi(t)dt -h Z{t)dB{t)\ t e [0, T) 
Y(t) =L(co); te[T,T + 8], 



(41) 



dX(t) = u(t)dY(t) = u(t)A(t)p.(t)dt + u(t)Z(t)dB(t); t e [0, T) 
X(0) = x >0. 



(42) 



Let!/ : [0, oo) [— oo, oo) be a given utility function, assumed to be increasing, 
concave and on (0, oo). We study the following portfolio optimization problem: 
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Problem 1 Find w* g ^ 2 / such that 

sup E[U{X^{T))] = E[U{X^\t))1 (43) 

This is a problem of the type discussed in the previous sections, with f = xf/ = 
N = 0, (p = U and h(x,co) = L(co), and we can apply the maximum principles 
from Sect. 3 to study it. 

By (6) the Hamiltonian gets the form 

H(t, X, y, a, z, k, u, p, q, r, X) = uapi(t)p + uzq + api(t)X. (44) 

The associated backward-forward system of equations in the adjoint processes 
p(t), q(t), X(t) becomes 

• BSDE in p(t),q{t)\ 



dp(t) = q(t)dB(t) ; 0 < t < T 
piT) =U\X{T)), 

• SDE in X{t)\ 

dX{t) = pi(t — 3)\u{t — 8)p(t — 5) + X(t — S)]x[8,T](t)dt 
-\-u(t)q(t)dB(t) ; 0 < t < T 
X(0) = 0. 

The Hamiltonian can only have a maximum with respect u if 

A(t)fji(t)p(t) + Z{t)q{t) = 0. 

Substituting this into (45) we get 

dp{t) = —6{t)p{t)dB(t)\ 0 <t <T 
p(T) =U\X{T)), 

where 

0{t) := 

Z(t) 

From this we get 

p(t) = cexp(— [ 0{s)dB{s) [ (0(s))^ds); 0 < t < T 

Jo 2 Jo 



(45) 



(46) 



(47) 



(48) 



(49) 



( 50 ) 
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where the constant 



C = p(0) = E[U'(X{T)] 



(51) 



remains to be determined. 

In particular, putting t = T in (50) we get 

U'(X{T)) = p{T) = cexp(- [ 9{s)dB{s) ( {e{s)fds) (52) 

70 ^ 70 

or 

nT Y pT 

Z(D = ([/')-* (c exp(- / 9{s)dB{s)-- {9{s)fds)). (53) 

70 ^ 70 



Define 



T{T) = exp( r e{s)dB(s) - ]- f {9 {s)pds). (54) 

70 ^ 70 

Then by the Girsanov theorem the measure Q defined on by 

dQ{cD) = T{T)dP{cD) (55) 

is an equivalent martingale measure for the market (41). Therefore, by (53), 

X = Eq{X{T)] = EW’rHcQxpi- r 9{s)dB{s)-\ C {9 {s)fds))T{T)]. 

70 ^ 70 

(56) 

This equation determines implicitly the value of the constant c and hence by (53) 
the optimal terminal wealth X{T) = {T). To find the corresponding optimal 

portfolio i/* we proceed as follows: 

Define 



Zo(0 :=u\t)Z{t). (57) 

Then (X“* (t), Zq( 0) is found by solving the linear BSDE 

dX“*{t) = Zoit)dB{t); 0<t <T 

X“*(D = £[([/')-* (cexp(-/o^ 9(s)dB(s) - (9(s)pdsW(T)]. 



We have proved: 
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Theorem 4 (Optimal portfolio in an insider influenced market) The optimal port- 
folio u"" for the problem (43) is given by 



u*(t) 



Zo(t) 
Z(t) ’ 



(59) 



where Zo(t), Z(t) are the solutions of the BSDEs (41), (58), respectively, and c and 
6 are given by (56) and (49), respectively. 



5.2 Predictive Recursive Utility Maximization 

Consider a cash flow X (t) = X^(t) given by 

dX{t) = X{t)[pi{t)dt + a{t)dB{t) 

+ y(t, ON(dt, JO] - c(t)X(t)dt; t e [0, T) (60) 

A(0) = x > 0. 

Here p.(t), a(t), y(t, are given bounded adapted processes, while u(t) := c(t) is 
our control, interpreted as our relative consumption rate from the cash flow. We say 
that c is admissible if c is F-adapted, c(t) > 0 and X^{t) > 0 for all r G [0, T). We 
put G = F. 

Let Y{t) = Y^{t), Z(t) = Z^(t), K(t, = K^{t, he the solution of the 
predictive mean-field BSDE defined by 

dY(t) = ~{a(t)A(t) +\n{c(t)X(t))}dt -h Z(t)dB(t) 

-h K{t, ON(dt, JO; t G [0, T) (61) 

Y(T) = 0, 



where a(t) > 0 is a given bounded F-adapted process. Then, inspired by classical 
definition of recursive utility in [3], we define 7*^(0) to be the predictive recursive 
utility of the relative consumption rate c. 

We now study the following predictive recursive utility maximization problem: 

Problem 2 Find c* g ^ such that 

sup 700) = 7^*(0). (62) 

We apply the maximum principle to study this problem. In this case we have 
f = (p = h = 0, \l/(x) = X, and the Hamiltonian becomes 

H(t, X, y, a, z, k, u, p, q, r, X) = x[(fi(t) — c)p (j{t)q -h / y{t, t;)r{t;)v{dt , JOl 

m 

-h [aa{t) -h Inc -h \nx]X. 



(63) 
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The associated backward-forward system of equations in the adjoint processes 
p(t), q(t), k(0 becomes 

• BSDE in p(t),q{t)\ 

dp{t) = -[(Mt) - cit))p(t) + cr{t)q{t) + fjg y(t, d^) + ^]dt 

+q{t)dB{t) + ^)N(dt,d^) ; 0<t<T 

P(T) =0, 

(64) 

• SDE in X(t): 



dX{t) = a(t — S)X(t — S)]x[s,T]{t)dt : 0 <t < T 

A(0) = 1. 



(65) 



The delay SDE (65) does not contain any unknown parameters, and it is easily seen 
that it has a unique continuous solution X{t) > 1, which we may consider known. 

We can now proceed along the same lines as in Sect. 5.2 of [1]: Maximizing H 
with respect to c gives the first order condition 



c(t) = 



m 

x{t)p{ty 



The solution of the linear BSDE (64) is given by 



( 66 ) 



k(^)E(^) _ 

T{t)p{t) = E{ / Yr ^ 

Jt ^(s) 



(67) 



where 

dV(t) = — c(t))dt + a(t)dB(t) + /jj y(t, ^)N{dt, df)] ; 0 < t < T 

r(0) = 1. 

( 68 ) 



Comparing with (60) we see that 

X(t) =xr(t) ; 0 < t < T. 
Substituting this into (67) we obtain 






p(t)X{t) = E[ X{s)ds\^,] ; 0<t <T. 



(69) 



(70) 
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Substituting this into (66) we get the following conclusion: 

Theorem 5 The optimal relative consumption rate c* (t)for the predictive recursive 
utility consumption problem (62) is given by 

C*(t) = ^32 ; 0<t <T, ( 71 ) 

E[j; X{s)ds\^,] 

where ^(t) is the solution of the delay SDE ( 65 ). 
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Modelling the Impact of Wind Power 
Production on Electricity Prices 
by Regime-Switching Levy Semistationary 
Processes 



Almut E.D. Veraart 



Abstract This paper studies the impact of wind power production on electricity 
prices in the European energy market. We propose a new modelling framework 
based on so-called regime-switching Levy semistationary processes to account for 
forward-looking information consisting of predicted wind power generation. We 
show that our new regime- switching model, where the regime switch depends on the 
so-called wind penetration index, can describe recent electricity price data well. 



1 Introduction 

Renewable sources of energy are of increasing importance in modern energy mar- 
kets. For instance, the European Union has set the target of increasing the share of 
energy from renewable sources by 2020 to 20 %. In the German and Austrian energy 
market, which will be the focus of this paper, the most important source of renew- 
able energy is wind, followed by biogas and solar. Since many renewable sources are 
highly dependent on weather conditions, they tend to increase the volatility of the 
corresponding energy prices. It is hence urgent and important to find reliable models 
which can describe electricity prices in these changing market conditions which can 
be used for risk assessment and management in energy markets. 

The recent literature has presented a variety of both discrete-time and continuous- 
time time series models which promise to describe the stylised facts of energy 
markets, see e.g. [3, 17] for reviews. However, reliable models which incorporate 
information on renewable sources have only recently emerged and have currently 
been restricted to discrete-time models, see e.g. [9-12, 18]. 

This paper contributes to the continuous-time literature by introducing for the first 
time a modelling framework which takes the forward-looking information avail- 
able to market participants through wind production forecasts into account when 
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modelling electricity day-ahead prices. In doing so, it extends recent work by [1] 
who proposed to model electricity spot prices by so-called Levy semistationary 
processes. Their model consists of a reduced form approach, which only considers 
electricity prices directly and does not take any price information from other fuels 
or commodities into account. Note that other types of forward-looking information, 
such as capacity constraints, have been incorporated in models for electricity prices 
by [8]. Also, [2] have developed a framework for incorporating forward-looking 
information in electricity or weather markets through an enlargement of filtrations 
approach. 

With the increasing power generation through wind farms, we have observed 
that electricity prices at the European Energy Exchange (EEX) started to become 
negative, which happened for the first time in October 2008, and partially even 
exhibited rather extreme negative price spikes, see e.g. [ 1 6] and the references therein. 
Also, various articles have argued that increasing wind power production seems to 
decrease the overall price level, but tends to increase the observed volatility in the 
market, see e.g. [12]. These are important findings, which need to be incorporated 
into a modelling framework, one of which will be presented in this paper. 

The outline for the remaining part of this article is as follows. In Sect. 2 we give 
a detailed description of the data from the EEX which will be used in our empirical 
analysis and we carry out an exploratory data analysis to motivate the new model we 
are going to introduce in this paper. Section 3 contains the main contribution, where 
we introduce the new class of regime- switching Levy semistationary processes and 
show how they can be calibrated to our empirical data. Einally, Sect. 4 concludes. 



2 Exploratory Data Analysis 

This section presents the results of an exploratory data analysis of electricity price 
and wind production data from the European Energy Exchange, which motivates the 
new modelling framework which we will introduce in Sect. 3. 



2.1 Description of the Data 

Our empirical data analysis focuses on electricity prices and wind data for a time 
period from 01.01.2011 to 31.07.2014, i.e. consisting of 1308 days. 

We consider three sets of data: electricity prices, their corresponding volumes 
(sometimes called loads) and wind production data. More precisely, daily EEX 
Phelix baseload and EEX Phelix baseload volume data (for Germany and Austria) 
were downloaded from Datastream and the EPEX spot website. In addition, we 
downloaded iht forecasted wind production data for the four German Transmission 
System Operator (TSOs) (50 Hz Transmission, Amprion, Tennet TSO, EnBW 
Transportnetze (Transnet) ) and one Austrian TSO (Verbund (APG)). These data 
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Table 1 Summary statistics of the EEX Phelix baseload from 01 .01 .201 1 to 3 1 .07.2014 



Minimum 


1st Quartile 


Median 


Mean 


3rd Quartile 


Maximum 


-56.87 


33.87 


43.10 


41.96 


50.69 


98.98 



have been aggregated to obtain daily forecasts for the wind production for each of 
the five TSOs. 



2.2 EEX Phelix Baseload Prices 



First of all, we want to explore the specific properties of the time series of the 
electricity price data. We focus on the day-ahead electricity prices determined by a 
daily auction at 12:00 pm, 7 days a week all year (including statutory holidays). The 
underlying quantity to be traded is the electricity for delivery the following day in 
24 h intervals. The prices are bounded (currently between [-500, 3000] EUR/MWh). 
The EEX Phelix baseload is obtained as the daily averages of the 24 h day-ahead 
prices for Germany and Austria. 

From the summary statistics in Table 1, we notice that the times series does not 
feature any truly extremes spikes, which have occurred in older data sets from the 
EEX market. In addition, we notice that there are negative prices even in the baseload 
prices which consists of the average of the 24 h prices, see also Fig. 1. The plot of 
the autocorrelation function reveals a clear weekly pattern which is one of the well- 
known stylised facts of such data. 

When we study the distributional properties of the price data, see Fig. 2, we clearly 
observe that the empirical distribution is not well described by a Gaussian distribu- 
tion, but appears to be asymmetric and features heavier tails, the latter is particularly 
pronounced when we focus on the left tail of the distribution. 




Fig. 1 Time series plot of the baseload prices (in EUR/MWh) and autocorrelation plot 
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Fig. 2 Distributional properties of the prices: The standardised histogram of the empirical distrib- 
ution and estimated kernel density function of the prices are depicted in the first plot. The second 
plot compares the empirical distribution to a Gaussian distribution via a quantile-quantile plot 



2.3 Predicted Wind Energy Feed-In 

Next we investigate the data from the five Transmission System Operator (TSOs) in 
Germany and Austria. Note that we are studying the one-day ahead predicted wind 
feed-in since we assume that this is the quantity which impacts the one-day ahead 
electricity prices determined in the daily auction. 

Note that for each TSO, the data is available in 15 min intervals, where the unit of 
measurement is Megawatt (MW). In order to get the hourly forecasts, we aggregated 
the data as follows. Let denote the 15 min wind power forecast for quarter 

i within hour t. We then obtain the hourly forecasts (recorded in Megawatt hours 
(MWh)), denoted by where t(j) denotes the jth hour on the tth day, from 

i = l 

Further, we obtain the daily forecasts, denoted by from 

24 

y/^^[MW • h] = ^ ^/o')[MW . h]. 

;=i 

The data are summarised by TSO in Table 2. 

In the following study, we focus on the aggregated time series which accumulates 
the predicted wind feed-in of all five TSOs, see Table 3 for the summary statistics 
and Fig. 3 for the time series plot. We clearly observe a yearly cycle in the wind data 
showing that the predicted wind feed-in is always highest during the winter time 
period. 
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Table 2 Forecasted wind feed-in in Gigawatt hours (GWh) (rounded) 





50 Hz 


Amprion 


APG 


Tennet 


Transnet 


Total 


2011 


19490 


6687 


18 


18484 


416 


45095 


2012 


20203 


7253 


25 


20464 


246 


48191 


2013 


19129 


7742 


31 


21259 


425 


48585 


2014 (Until July) 


11648 


4799 


21 


13412 


399 


30279 



Table 3 Summary statistics of the forecasted aggregated wind production data (in GWh) from 
01.01.2011 to 31.07.2014 



Minimum 


1st Quartile 


Median 


Mean 


3rd Quartile 


Maximum 


12510 


53870 


98940 


131600 


176000 


572600 




Fig. 3 Time series of the forecasted aggregated wind production data from 01.01.2011 to 
31.07.2014 reported in GWh 



2 A Wind Penetration Index 



Jonsson et al. [10] pointed out that there is a non-linear and time dependent relation- 
ship between wind power forecasts and spot prices. Moreover, they found that “it is 
in fact the ratio between the forecasted wind power generation and the forecasted 
load that has the strongest association with the spot prices”, see [10, p. 314]. Hence, 
in the following analysis, we will not use the predicted wind feed-in data directly, but 
rather focus on the so-called wind penetration index, which describes the percentage 
of the wind feed-in compared to the total energy production. 
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Table 4 Summary statistics of the wind penetration index from 01.01.201 1 to 31.07.2014 



Minimum 


1st Quartile 


Median 


Mean 


3rd Quartile 


Maximum 


0.02104 


0.08796 


0.15480 


0.19050 


0.26450 


0.67540 



In order to compute this index, we follow the approach outlined in [10]: They 
argued that according to recent work by [15], state-of-the-art forecasting models 
lead to load forecasts where the predicted load equals the actual load plus an error 
term, i.e. let AL^ denote the actual load on day t and let Lt denote the corresponding 
predicted load. Then 



ALt = Lt €t, where A^(0, a^). 



Typically, the standard deviation a is chosen as 2 % of the average load for the period 
considered. 

Following this methodology, we downloaded the actual load data from Datastream 
and the EPEX website and computed the predicted load data by adding Gaussian 
perturbations to the actual load data. Clearly, this is not exactly the same as working 
with the predicted load information from each TSO directly, see [10] for a discussion, 
but the practical impact of this approximation has been found to be marginal. 

We can now define the so-called wind penetration index on day t as 



WPt := 




This is in fact the prediction of the wind penetration on day t, which is available on 
day t — \ and can hence be considered as forward-looking information. We provide 
the summary information of the wind penetration index for our sample in Table 4. 

Moreover, a time series plot and the corresponding histogram of the wind pene- 
tration index is depicted in Fig. 4. We observe that the time series plot of the wind 
penetration index resembles the one for the original wind data — including a yearly 
seasonal pattern. Also, the wind penetration index is overall rather low, which is 
indicated by the histogram and the quantile information contained in Table 4. This 
is not surprising since the conventional fuels still account for the majority of the 
electricity production in the European energy market. 



2.5 The Relation Between Prices and Wind Data 

Finally, we carry out an exploratory study of the relation between the electricity 
prices and the wind penetration index. In Fig. 5, we plot the electricity prices versus 
the wind penetration index to check whether we can spot any association between the 
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Fig. 4 Time series plot and histogram of the wind penetration index 




Prices 



Fig. 5 The electricity prices are plotted versus the wind penetration index. The five horizontal 
lines correspond to the minimum, the 25 %, 50%, 75 % quantiles and the maximum of the wind 
penetration index, respectively 



two variables. We observe that the two lowest electricity prices are associated with 
a rather high wind penetration index. Also, for a very high wind penetration index, 
the prices seem to be below their mean value. This is in line with earlier studies 
which found that a high wind production typically results in lower electricity prices. 
However, we need to keep in mind that by comparing the wind and the electricity 
prices, we can only obtain a partial picture, since clearly other fuels, such as coal, 
gas and nuclear, play a key role in determining the corresponding electricity price 
and are for the purpose of this study excluded from the analysis. 
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Fig. 6 Distribution of the electricity prices for different quartiles of the wind penetration index. 
E.g. the first plot corresponds to a low wind penetration index (the first quartile) and the last one to 
a rather high wind penetration index (the fourth quartile) 



We also compare the distribution of the electricity prices associated with different 
quartiles of the wind penetration index. That is, we have divided our price data into 
four groups corresponding to the 1st, 2nd, 3rd and 4th quartile of the wind penetration 
index. When comparing the corresponding marginal distributions, we observe again 
that smaller price data are associated with a higher wind penetration index, see Fig. 6. 

The finding from this exploratory study motivates the new modelling framework 
which we are going to introduce in the next section. 



3 Model Building 

Recent work by [1] suggests that the class of so-called Levy semistationary (LSS) 
processes is very suitable for modelling electricity day-ahead prices. In their work, the 
class of LSS processes was used in a truly reduced form modelling set-up, meaning 
that the (deseasonalised) electricity prices were modelled directly by LSS processes 
and no other external variables were included in the analysis. 

Here we will go one step further and explore the possibility of including / 6 >nv( 2 rJ- 
looking information in form of the wind penetration index into a new modelling 
framework which is based on LSS processes. 

In a first step, we are going to review the basic traits of LSS processes and then we 
will discuss how an LSS-based model can be extended to account for forward-looking 
information. 
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An LSS process Y = {Y (0}«eR on R without drift is defined as 

r(t)= / g{t - s)(j{s-)dL(s), (1) 

J —oo 

where L denotes a two-sided Levy process, g : R ^ R denotes a determinis- 
tic weight function satisfying g(s) = 0 whenever s < 0 and a denotes a cadlag, 
adapted stochastic volatility process, which is assumed to be independent of L. In 
order to ensure the existence of the stochastic integral, we need suitable integrability 
conditions on the kernel function g, see [1] for details. 

Note that the name Levy semistationary process indicates that the process Y is 
stationary as soon as the stochastic volatility process cr is a stationary process. The 
reason for choosing a stationary process for modelling deseasonalised electricity 
prices is that commodity prices typically exhibit strong mean reversion. A stationary 
process can in fact mimic such a behaviour since it ensures that the process cannot 
move away from its long term mean indefinitely, but will need to return to it since 
otherwise the stationarity would not be preserved. 

Many well-known stochastic processes belong to the LSS class, including volatil- 
ity modulated Omstein-Uhlenbeck processes, continuous-time autoregressive mov- 
ing average (CARMA) processes and fractionally integrated CARMA processes. 

It is important to note that LSS processes are in general not semimartingales, 
which has been discussed in detail in [1]. However, in this paper, we will in fact be 
staying within the traditional semimartingale framework since we will be working 
with volatility modulated CARMA processes as the main building blocks for our 
new model. To this end, let us briefly recall their definition. 

Suppose we have nonnegative integers p > q and we wish to define a CARMA 
(p, q) process. We introduce the autoregressive (AR) and moving average (MA) 
polynomials: 



P^^(p)(z) = zP +aizP-^ + --- + ap, 

+ ^12 H 1- bp-iZ^~^, 

where = Imdbj =0forq < j < p. Moreover, we assume that the polynomials 
have no common roots and then write formally 

/’AR(P)(£))y(f) = 



where D = We can make sense of the “derivative” of the Levy process through 
a state space representation, where we write 



Y{t) = b"^V(?) , for d\{t) = NW{t)dt + edL(t), where 
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Assuming that all eigenvalues of A have negative real parts, we know that 



V(r) = r e 

J —OO 



Mt-s) 



edL(s) 



is the (strictly) stationary solution of the stochastic differential equation above, see 
[5]. That is, in our LSS specification we can choose g(x) = and cr = 1 to 

obtain a CARMA(p, q) process. As soon as a stochastic volatility process is added, 
we would call the corresponding LSS process a volatility modulated CARMA process. 



3.1 Deseasonalising the Data 

We argued before that stationary processes can easily accommodate key stylised 
facts of commodity prices. However, at the same time, we cannot ignore that strong 
seasonal effects are typically present in such markets and need to be accounted for. 
We proceed by introducing an arithmetic model for the electricity day-ahead price, 
denoted by S = (S(t))t>o, where 

S(t) = A(t) + Y(t). 

Here, A denotes a deterministic seasonality and trend function and Y denotes a sto- 
chastic process. In the original framework proposed by [1], the process Y was chosen 
to be an LSS process. In the following, however, we will introduce a modification of 
that modelling framework. 

The seasonality and trend function is chosen to be 

/ ri -h lirt \ ( T2~\- 27Tt 

A(t) = co + cit -h C 2 cos I — I + <^-3 cos I 

which accounts for a linear trend and weekly and yearly seasonal cycles. We used a 
robust least squares estimation procedure to estimate the parameters (by truncating 
the spikes in the estimation procedure) and obtained the following estimates and 
standard errors, see Table 5, all of which were highly significant. The estimated 
parameters in the seasonality function confirm the existence of both weekly and 
yearly seasonality as well as the presence of a negative trend. 
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Table 5 Estimated parameters in the seasonality and trend function A 





CO 


Cl 


C2 


C3 


r\ 


T2 


Estimate 


53.47 


-0.01724 


2.377 


-6.815 


349.25 


33.59 


Standard error 


0.4693 


0.0006203 


0.3356 


0.3302 


50.46 


0.3394 



o 

o 




Days 

Fig. 7 Electricity day-ahead data and fitted seasonality and trend function 



Figure? shows a graph of the original data with the fitted seasonality function 
super-imposed. Also, Fig. 8 depicts the deseasonalised price data and their autocor- 
relation function. 

It should be noted that a variety of alternative procedures could be followed to deal 
with the problem of seasonality in the electricity prices. Here we are dealing with a 
rather simple deterministic parametric function to mimic the trend and the seasonal 
cycles. Using e.g. weekly and yearly dummy variables could refine this approach even 
further, but would result in a less parsimonious model. A more interesting alternative 
to the approach pursued here is to acknowledge the fact that the seasonality cannot 
only be determined by historical data, but also through other market data available to 
market participants. Example, it has been observed that gas and coal prices, given that 
they are important fuels used to produce electricity, play a key role in determining 
trend and also seasonal cycles of electricity prices. This suggests that e.g. forward 
curve data for gas and/or coal could be used to model the trend, see e.g. [8] for 
research along those lines. Also [14] give a detailed account on robust estimation 
procedures of the long-term seasonal component of electricity prices. 
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Fig. 8 Time series and autocorrelation plot of the deseasonalised electricity day-ahead data 



Table 6 Estimated parameters of the associated ARMA(2,1) process 





ARl 


AR2 


MA 


Estimate 


1.1480 


-0.2324 


-0.6962 


Standard error 


0.0597 


0.0447 


0.0501 



Here ARl and AR2 corresponds to the first and second autoregressive parameter, respectively, and 
MA corresponds to the moving average parameter 



3.2 Fitting a CARMA Process 

After the seasonality has been removed, we need to find a suitable model for the 
stochastic process Y. Following the success of the CARMA processes in describing 
electricity prices, we choose a Levy semistationary process where the kernel function 
is given by a kernel associated with a CARMA(p,q) process. More specifically we 
choose p = 2 and q = 1. Note that when choosing the order of the CARMA(p,^) 
process, we need to consider pairs (p,q) such that p > q so that the CARMA 
process is well defined. We choose a CARMA(2,1) process due to reasons of analyt- 
ical tractability and increased flexibility compared to a simple Ornstein-Uhlenbeck 
model. In our goodness-of-fit study, we indeed find that such a model choice is 
suitable here. 

Note that [6] have discussed in detail how a discretely sampled CARMA process 
can be represented as a weak ARM A process. Using this representation we have first 
estimated the corresponding ARMA(2, 1 ) parameters by a quasi-maximum likelihood 
method. The corresponding parameters estimates and standard errors are provided 
in Table 6. 
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Fig. 9 Empirical {bars) and estimated {solid line) autocorrelation function of the estimated 
CARMA(2,1) process 

Following the procedure outlined in [6] we can then recover the corresponding 
continuous-time parameters. In our case, we have a\ = 1.459, a 2 = 0.162, Z?o = 
0.383. Note that one can easily verify that the estimates satisfy the condition that 
the eigenvalues of A have negative real parts, which implies a stationary model. We 
compare the empirical and estimated autocorrelation function in Fig. 9, where we 
observe a good fit. 

Under the assumption that the CARMA process is driven by a subordinator, [6] 
have shown how the corresponding increments of the driving Levy process can be 
recovered from a discretely observed CARMA process, see also [7] for the multivari- 
ate case. Here we have implemented their algorithm for the case of a CARMA(2,1) 
process and have recovered the driving process. Note that the original algorithm 
was designed for driving Levy processes, but can in fact be adapted to the case of 
volatility modulated Levy processes as well, the case which will be relevant in the 
next section. 

Let us briefly recap our estimation procedure until now: We started off with a spot 
price model S{t) = A{t) -\- Y {t), where we have estimated the seasonality function 
A and have removed it from the data. In the next step, we have assumed that Y is an 
LSS process of the form 




— 00 



g{t — s)dM{s), 
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where g corresponds to the kernel function associated with a CARMA(2,1) process 
and initially M was assumed to be a Levy process, which can be recovered from the 
observations of the CARMA process. We will now leave this traditional framework 
behind and will introduce a new regime- switching model based on LSS processes. 



3.3 The New Model Based on a Regime-Switching LSS 
Process 



The predicted wind penetration index can be viewed as forward-looking information 
since the information is available before the prices for the next day are determined 
in the auction market. Hence it is reasonable to try to incorporate this information in 
the model. 

Previous studies have included such information in discrete-time models such as 
e.g. ARMAX-GARCHX models, see [12], where the wind is treated as an exoge- 
nous variable. However, we are interested in a continuous-time modelling framework. 
E.g. one could consider CARMA-X models or regime- switching models. Here we 
will follow the latter approach which is motivated by the work by [8] who incorpo- 
rated forward-looking capacity constraints into a jump-diffusion model for electricity 
prices. 

We introduce an exogenous regime- switching variable based on the forward- 
looking variable given by the predicted wind penetration index p, where 



pis) 



1 , if the predicted wind penetration at time 5' is “high” 

0, if the predicted wind penetration at time 5' is “low”. 



( 2 ) 



The new spot price model is then given by S{t) = A{t) -\-Y (0, where 



Y{t) 



-f 

J — cx 



g{t — s)dM{s), t >Q. 



Here 



dM{s) = p{s)dM^^\s) + (1 - p(s))dM^^\s), 



where 



dM^^\s) = a^^\s)ds -h (j^^\s-)dL^^\s), 



for independent Levy processes and Also, denote suitable drift and 
stochastic volatility processes, for / g {1, 2}. 

The key question which remains to be addressed is how exactly the regime- 
switching variable p should be chosen, given that the expression in (2) appears 
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Fig. 10 Standardised histograms of the increments of M for different levels (associated with the 
four quartiles) of the wind penetration index 



rather casual. In order to answer this question, we study the empirical properties of 
the recovered driving process M. More precisely, we investigate how the marginal 
distribution of the driving process of the CARMA process changes in relation to 
different levels of the wind penetration index. We split the sample of the recovered 
increments of M into four parts corresponding to the four quartiles of the wind 
penetration index, which are given in Tabled. 

Their empirical distributions are described in form of standardised histograms, 
which describe the empirical probability density functions, in Fig. 10. Similarly to 
the finding in our exploratory data analysis, we observe that also the distribution 
of the increments of the driving process M changes quite remarkably for different 
levels of the wind penetration index. In particular, we observe that rather extreme 
negative increments are associated with a relatively high wind penetration index. 

One can imagine a variety of rather sophisticated methods for choosing the cut-off 
point for our regime-switching variable. Here we are interested in a rather simple 
rule, which at the same time allows for a reasonable amount of observations in the 
high regime so that inference is still feasible and does not just rely on a very small 
number of observations. Hence, we choose the cut-off point to be 26.4 % as a hard 
threshold, meaning that all increments of M associated with a wind penetration index 
in the fourth quartile belong to the high regime. 
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3.4 Model for M Based on the Generalised Hyperbolic 
Distribution 



Motivated by the empirical study in [1], we will fit the class of generalised hyperbolic 
(GH) distributions to the increments of M in the two regimes. 

Our notation for the GH distribution follows the one used in [13]. See also [4] for 
more details on the implementation of the corresponding estimation procedures in R 
available through the ghyp package. 

Let us denote by d,k g N some constants and let X denote a ^-dimensional 
random vector. Recall that we say that the law of X is given by the multivariate 
generalised hyperbolic (GH) distribution if 

X '= ^ + W 7 + ^/wcz, 



where Z ~ N(0, Ik), C e e Here W > 0 denotes a one-dimensional 

random variable, independent of Z and with Generalised Inverse Gaussian (GIG) 
distribution, i.e. W ^ G/G(A, x, t/^). The density of the GIG distribution with 
parameters (A, x, V^) has the following functional form: 



/gigM = ( — 






^A-l 



2Kx(Vj^) 



exp 



(-5(1+'*')) 



where K\ denotes the modified Bessel function of the third kind, and the parameters 
have to satisfy one of the following three restrictions 



X > 0, > 0, A < 0, or X > > 0, A = 0, or x ^ 0, 7 /^ > 0, A > 0. 

The parameter p, is called the location parameter, E = CC^ is the dispersion matrix 
and 7 is the symmetry or skewness parameter. The three (scalar) parameters A, x, of 
the GIG distribution determine the shape of the GH distribution. The parametrisation 
described above is referred to as the so-called (X, x, ^ , 7 )-parametrisation of 

the GH distribution. Since this parametrisation causes an identifi ability problem when 
one tries to estimate the parameters, we will work with the so-called (A, a, p, Z, 7 )- 
parametrisation in our empirical study. One can show that the (A, x, 7)- 

parametrisation can be obtained from the (A, a, /i, iJ, 7 )-parametrisation by setting 

_/^A+l(a) _ K\(a) 

Kx(a) ^ ^ i; ^A+i(^) 

and A, U, 7 remain the same, cf. [4]. 

We estimated 1 1 distributions within the GH class — consisting of the asymmetric 
and symmetric versions of the GH, hyperbolic. Student’s t, variance gamma, nor- 
mal inverse Gaussian distribution and the Gaussian distribution and compared them 
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Table 7 Parameter estimates for the Student’s t-distribution in the low regime (symmetric case) 
and high regime (asymmetric case) 





V 




a 


7 


Low regime 


6.70 


2.26 


9.77 


0 


High regime 


4.76 


0.57 


11.31 


-7.68 



Note that the parameter a = 0 in the case of the Student’s t-distribution 





Theoretical Quantiles 



Theoretical Quantiles 



Fig. 1 1 Diagnostic quantile-quantile plots : The first picture compares the empirical quantiles of the 
data in the low regime with the estimated symmetric Student’s t-quantiles, and the second picture 
compares the empirical quantiles of the data in the high regime with the estimated asymmetric 
Student’s t-quantiles 



according to the Akaike information criterion. We found that the best model for 
the low regime is given by the symmetric Student’s t-distribution and for the high 
regime by the asymmetric Student’s t-distribution, see Table? for the corresponding 
parameter estimates. 

When comparing the parameter estimates for the low and the high regime provided 
in Table 7, we observe that the skewness, fatness of the tails and the volatility increases 
for the high regime and that the mean parameter decreases compared with the low 
regime. This is in line with previous findings in the literature, which suggest that 
the price level typically decreases with increasing wind energy production and that 
the volatility and the risk for negative spikes (represented through negative skewness 
and fatter tails) typically increases. 

Also, we have provided quantile-quantile plots to assess the goodness-of-fit of the 
Student’s t-distribution in Fig. 11, which overall look reasonable. 

The estimation results suggest that a good model for the driving process M in the 
regime- switching LSS specification is given by 



338 



A.E.D. Veraart 



dM{s) = p{s)dM^^\s) + (1 - p(s))dM^^\s), 

where 

dM^-^^s) = + 7 ds + 

dM^^\s) = + cr®(sMW®(s), 



for independent Brownian motions and Here the stochastic volatility 

processes are chosen as Ornstein-Uhlenbeck processes with inverse Gamma 
marginal distribution, since a mean- variance mixture with the inverse Gamma dis- 
tribution results in the Student’s t-distribution. 

Note that the reason for choosing volatility modulated Brownian motions rather 
than Levy processes with Student’s t-distribution is that we found a significant short 
term (2 lags) autocorrelation in the increments of the recovered process M suggesting 
that a stochastic volatility model is more suitable than a pure jump model. This finding 
reveals that stochastic volatility is a key feature in energy markets, but it typically 
only exhibits short memory. Stochastic volatility is naturally incorporated into the 
LSS framework making it a convincing modelling tool for energy markets. 



4 Conclusion 

This paper has presented an extension of the modelling framework based on Levy 
semistationary (LSS) processes introduced by [1]. Since forward-looking informa- 
tion in terms of weather forecasts is available to market participants, the corre- 
sponding predictions for the day-ahead wind production can be derived and used 
when determining day-ahead electricity spot prices. We incorporated this informa- 
tion through the so-called predicted wind penetration index in a regime- switching 
model based on LSS processes. We have observed that the flexibility offered through 
the regime switching component allows to model electricity prices in a more refined 
way than it was possible in the original (reduced form) LSS modelling framework. 
In particular, we have found that a relatively high wind penetration index leads to a 
lower mean level, higher skewness, fatter tails and increased volatility in the distribu- 
tion of the electricity prices. This confirms earlier findings in the literature and for the 
first time links them to a flexible continuous-time stochastic modelling framework. 

Given the increasing importance of renewable sources of energy, it will be inter- 
esting to extend the current investigation to include a wider range of renewables, 
including solar and biogas which up to now do not play as big of a role as wind 
power generation in the European energy market. 

Another area for future research would be to develop a stochastic model for the 
wind penetration index, which could either result in a regime-switching model with 
a stochastic switching parameter or in a joint model for electricity prices and the 
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wind generation index. A preliminary analysis along those lines has revealed that 
a reasonable model for p needs to take both yearly seasonality and clusters into 
account. This could be seen as a first step for constructing models which can be used 
for mid-term forecasts of electricity prices influenced by renewables and would help 
to find modelling and inference tools for reliable risk management in energy markets. 
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Pricing Options on EU ETS Certificates 
with a Time- Varying Market Price of Risk 
Model 



Ya Wen and Rudiger Kiesel 



Abstract To price options on emission certificates reduced-form models have 
proved to be useful. We empirically analyse the performance of the model proposed 
in Carmona and Hinz [2] and Hinz [8]. As we find evidence for a time-varying mar- 
ket price of risk, we extend the Carmona-Hinz framework by introducing a bivariate 
pricing model. We show that the extended model is able to extract information on 
the market price of risk and evaluate its impact on the EUA options. 

Keywords Carbon market • EUA futures • Risk-neutral valuation • Market price of 
risk • Option pricing 



1 Introduction 

The European Union Emissions Trading Scheme (EU ETS) was launched in 2005 
and constitutes still the world’s largest carbon market. The EU ETS was set up 
as a cap-and- trade scheme and split up into three phases, namely Phase I (2005- 
07) without the possibility to bank unused permits; Phase II (2008-12) in which 
banking was allowed; and the current Phase III (2013-20) which, compared to the 
two previous periods, introduced significant changes, such as the abolishment of 
national allocation plans and the auctioning of permits. 

Besides permits, futures and options on permits are being traded. Various authors 
have discussed the design of the market and the pricing of the permits and the 
derivatives traded. The fundamental concepts for emission trading and the market 
mechanism have been reviewed in the paper of Taschini [15], which also provides 
a literature overview. Equilibrium models for allowance permit markets have been 
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widely used to capture the theoretical properties of emission trading schemes. Exam- 
ples are the dynamic but deterministic model proposed by Rubin [13] and stochastic 
equilibrium approaches such as Seifert et al. [14], Wagner [16] and Carmona et al. 
[3]. These models use optimal stochastic control to investigate the dynamic emission 
trading in the risk-neutral framework. Carmona et al. [4] derive the permit price for- 
mula which can be described as the discounted penalty multiplied by the probability 
of the excess demand event. Its historical model fit has been evaluated by Griill and 
Taschini [6]. Griill and Kiesel [5] used the formula to analyse the emission permit 
price drop during the first compliance period. Carmona and Hinz [2] and Hinz [8] 
propose a reduced-form model which is particular feasible for the calibration of EUA 
futures and options as it directly models the underlying price process. Both Paolella 
and Taschini [12] and Benz and Truck [1] provide an econometric analysis for the 
short-term spot price behavior and the heteroscedastic dynamics of the price returns. 
Eor the option pricing, Carmona and Hinz [2] derive a option pricing formula from 
their reduced-form model for a single trading period. They also discuss the extension 
of the formula to two trading periods. Hitzemann and Uhrig-Homburg [9, 10] develop 
an option pricing model for multi-compliance periods by considering a remaining 
value component in the pricing formula capturing the expected value after a finite 
number of trading periods. 

As emission certificates are traded assets their price paths carry information on 
the market participant expectations on the development of the fundamental price 
drivers of the certificates including the regulatory framework. In particular, prices 
of futures and options of certificates carry forward-looking information which can 
be extracted by using appropriate valuation models. In this paper we derive such 
a model by extending the reduced-form pricing model of Carmona and Hinz [2]. 
Using an extensive data set we extract a time series for the implied market price of 
risk, which relates to the risk premium the investors attach to the certificates. This 
requires a calibration of the model to historical price data during varying time periods 
and with different maturities of futures and options. A crucial step in the calibration 
procedure is a price transformation of normalized futures prices of permits from a 
pricing measure to the historical measure. We find that the implied market price of risk 
possesses stochastic characteristics. Therefore, we extend the existing reduced-form 
model by modeling the dynamics of the market price of risk as an Omstein-Uhlenbeck 
process and show that the extended model captures the appropriate properties of the 
market. The market price of risk is an implied value related to the permit prices, this 
requires an extension of the univariate permit pricing model to a bivariate one. In 
this context, the standard Kalman filter algorithm is considered to be an effective 
way to calibrate to the historical prices. We apply this methodology and estimate the 
implied risk premia. Once the risk premia have been determined, EUA option prices 
can be derived to fit the bivariate model setting, which helps to improve the accuracy 
of the pricing. 

This paper is organized as follows. In Sect. 2 we introduce the basic reduced-form 
model based on a risk-neutral framework. We calibrate the model with an extended 
data series and compare the calibration results. In Sect. 3 an extended bivariate pricing 
model will be introduced in order to capture the market information of the risk premia. 
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We demonstrate how to calibrate the extended model by applying the standard 
Kalman filter algorithm, we present the estimation results of this procedure and 
discuss the model fit. In Sect. 4 we evaluate the option pricing performance by taking 
into account the calibration results of the bivariate model. Sections concludes. 



2 Univariate EUA Pricing Model and Parameter Estimation 

The basic reduced-form model based on a risk-neutral framework was introduced 
by Carmona and Hinz [2] . Here, the aggregated normalised emissions are modelled 
directly and it can be shown that the emission certificate futures process solves a 
stochastic differential equation. In this section we give a brief introduction to the 
model and discuss the quality of the model calibration. 



2.1 Univariate Model 

We consider an emission trading scheme with a single trading phase with horizon 
[0, r]. The price evolution of emission permits is assumed to be adapted stochastic 
processes on a filtered probability space (^2, (^r)re[0,r]. with an equivalent 

risk-neutral measure Q ~ P. Based on the assumption of a market compliance 
at time T the price has only two possible outcomes, namely zero or the penalty 
level. The argument is as follows. If there are sufficient emission allowances in 
the market to cover the total emissions at compliance time, surplus allowances will 
become worthless. Otherwise, for undersupplied permits the price will increase to 
the penalty level. 

We introduce the reduced- form model of Carmona and Hinz [2]. Here the nor- 
malized futures price process is a martingale under Q given by 

a, = E'0[l{r,>i)|=^r], re [0,7]. (1) 

The process (/l)^e[0,r] denotes the aggregated normalized emission, and is assumed 
to follow a lognormal process given by 

Ft = 

where Gf stands for the volatility of the emission pollution rate, r ^ is a determi- 
nistic function which is continuous and square-integrable. (Wt)te[0,T] is a Brownian 
motion with respect to Q. Carmona and Hinz [2] prove that at solves the stochastic 
differential equation 

dat = 0' (0-Hat)) 



( 2 ) 
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with the function t zt, t & (0,T), given by 

zr = • C 

In order to calibrate the model, Carmona and Hinz [2] suggest to use the function 



To estimate the parameters one has to determine the distribution of the price variable. 
For this purpose one considers the price transformation process defined by at = 
where (P denotes the cumulative distribution function of the standard normal 
distribution. By applying Ito’s formula one has 



where Wt denotes the Brownian motion under the objective measure P and h is the 
market price of risk which is assumed to be constant. Moreover, it can be shown that 
is conditional Gaussian with mean /x^ and variance af so that the log-likelihood 
can be calculated and Maximum-likelihood estimation can be applied to find the 
model parameters. 



2.2 Estimation 

We calibrate the model to different emission trading periods during the first and 
second EU ETS trading phases. We consider the daily prices of the EUA futures 
with maturities in December from 2005 to 2012. Their historical price series are 
shown in Fig. 1. 

The price transformation is conditional Gaussian with its mean [it and vari- 
ance We consider the daily historical observations of the EUA futures at time 
t\, t 2 , . . . , tn. Their corresponding price transformations can be determined using the 
definition at = ^(^t)- Thus the parameters a, P,h can be estimated by maximizing 
the log-likelihood function given by 



Zt = ^{T-t)-\ 



with ckf G M and P G (0, oo), so one has 



dat = 0'{0{at))y/^{T - t)-»dW,. 



( 4 ) 




( 5 ) 







2aha, P) 
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Fig. 1 Historical prices of the EUA futures with maturities in December from 2005 to 2012 



Under the model assumptions the residuals 



must be a series of independent standard normal random variables. So standard 
statistical analysis can be applied to test the quality of the model fit. We show our 
estimation results in Table 1. The horizons of the price data are two years, starting 
from the first trading day in January of the previous trading year to the last trading 
day in December of the next year. 

Comparing the estimation values in Table 1 , the instability of the parameter va- 
lues in each cell for different time periods can be observed. Note the value for the 
market price of risk changes its sign during the first and second trading phase. This 
implies the inappropriateness of the assumption for a constant market price of risk. 
The fourth value in each cell is the negative of LLF. Note the -LLF are much lower 
after the first trading phase because the price collapse during 2006 to 2007 affects 
the data partially. 

From Figs. 2, 3, 4, 5 and 6 we display the time series of the residuals w/, their empir- 
ical auto-correlations, empirical partial auto-correlations and quantile-quantile-plots. 
We choose EUA futures with maturity in December 2007 (EUA 07) and EUA futures 
with maturity in December 2012 (EUA 12) as examples. 

The time series Wf show an effect of volatility clustering. This is confirmed by 
significant values to high lags in the sample autocorrelation and sample partial auto- 
correlation. Also the Q-Q plots, especially for the first trading phase, indicate heavy 
tails and a non-Gaussian behavior. A formal analysis with an application of Jarque- 
Bera test rejects the hypothesis that the data set is generated from normally distributed 
random variables. In order to improve the model fit we extend the model by intro- 
duction of a dynamic market price of risk in Sect. 3. 






i = 1, ... 



(7) 




Table 1 (continued) 
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Sample Autocorrelation Function Sample Autocorrelation Function 
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Fig. 2 Statistical analysis of EUA 07, time period 05-06 and 06-07 
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Fig. 3 Statistical analysis of EUA 12, time period 05-06 and 06-07 



Sample Autocorrelation Function 
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Fig. 5 Statistical analysis of EUA 12, time period 09-10 and 10-11 
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Fig. 6 Statistical analysis of 
EUA 12, time period 11-12 




3 Bivariate Pricing Model for EUA 

The evidence in the previous section shows that the market price of risk is actually 
time varying rather than constant. In order to illustrate the dynamic property of the 
market price of risk we consider a bivariate permit pricing model in this section. 



3.1 Model Description 



We model the market price of risk as an Omstein-Uhlenbeck process as its value 
can be either positive or negative and denote it by Recall the equation for the 
normalized price process under the risk-neutral measure Q given by (2). According 
to Girsanov’s theorem, the bivariate pricing model under the objective measure P is 
given by 



dat = d>' {at))^t{hdt dW}), 
dkf = 0(k — Xt)dt + (Jxdwf, 
dW}dW^ = pdt, 

where W/ and are two one-dimensional Brownian motions with correlation coef- 
ficient p. Note that under the model assumptions, the filtration (^t) in the probability 
space must be assumed to be generated by the bivariate Brownian motion. 

The use of Girsanov’s theorem in the bivariate model requires the condition that 
the process Zt given by 




is a martingale. A sufficient condition for (8) is Novikov’s condition: 



E 



[exp(^ 



( 8 ) 



< 00 . 



( 9 ) 
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Under our model assumptions, this condition is always satisfied.^ 

To calibrate the model we use the transformed price process to avoid complex 
numerical calculations in the calibration procedure. The bivariate model can be refor- 
mulated as 



d^t = + V^^j) dt + ^,dW}, 


(10) 


dX{ = 9(k — Xt)dt + (JxdWf, 


(11) 


dWjdW]- = pdt. 


(12) 



In (11), X represents the long-term mean value. 0 denotes the rate with which the 
shocks dissipate and the variable reverts towards the mean, ax is the volatility of the 
market price of risk. According to Carmona and Hinz [2], the price transformation 
is conditional Gaussian and its SDE can be solved explicitly. 



3.2 Calibration to Historical Data 



We consider the discretization of the model (10)-(12). By assuming the constant 
volatility terms in the time interval [tk-i, tk], the model equations can be discretized 
under Euler’s scheme given by 

^tk = \/ ^fk-i ’ ( 13 ) 

Xt^ = (1 — H- 6XAt H- (14) 

Cov{Sl^l) = p^ (15) 

where At = tx — (tk-i), namely the time interval, and ^ aK(0, 1). Zt/^ can 

be modeled by using the function P(T — The model parameter-set is therefore 
xf/ = [0, X, ax, p, a, fi]. 

As Xtj^ is a hidden state variable related to the price transformation, and only values 
of ^tk ^1 1™^ points t\, t 2 , . . . , tn can be determined from the market observations, the 
market price of risk series can be estimated by applying the Kalman filter algorithm. 
We have chosen to use the transformation process instead of the normalized price atj ^ . 
Because of the linear form of Eqs. (13) and (14) the standard Kalman filter algorithm 
is considered to be an efficient method for the model calibration. A detailed procedure 
to apply the standard Kalman filter can be found in [7]. To apply the Kalman filter 
model Eqs. (13)-(15) must be put into the state space representation to fit the model 
framework. The measurement equation links the unobservable state to observations. 
It can be derived from (13) and (14). After some manipulations, the equations of the 



^ A proof can be found in Appendix 1. 
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state space form for the model can be rewritten^ as 



St,, = 'Jf^{T-tk) + 



i + ^(P(T 






) If* + 



to-“At4i, 

(16) 



and 



A. 



tk 



+(1 - e/\t - crxpAt)>.fj_| + 0x4(1 - p^)At(^^^, (17) 



where and are independent, standard normally distributed random variables. 



For the estimation of the parameter vector xj/ = [0,X, ax, p, a, fi] consider 
the variable In each iteration of the filtering procedure, the conditional mean 
• • • , ^tk-i] and the conditional variance Var(^tk\^ti. • • • , can be cal- 
culated. We denote mean and variance by and respectively. The joint 

probability density function of the observations is denoted hyf(^ti-n I and is given 
by 



/(if,:j^) = n 

k=l 



1 / (If*-/..(v-))^ y 

^2jTi:t,iir) 2Zt,{ir) ) 



where ^t\-n summarize the observations from to ^tn- Its corresponding log- 
likelihood function is given by 






n 

2 



log 27T 



^-^log Zt.iir) 
^ k=l 




(If* - 



(18) 



The estimation results, their standard errors, t- tests and p- values can be found in 
Table 2. Figure 7 shows the estimation results of the market price of risk, compared 
with the price transformation and the historical futures price. In Fig. 8, a negative 
correlation between the price transformation and the market price of risk can be seen. 
The market price of risk is the return in excess of the risk-free rate that the market 
wants as compensation for taking the risk.^ It is a measure of the extra required rate 
of return, or say, a risk premium, that investors need for taking the risk. The more 
risky an investment is, the higher the additional expected rate of return should be. 
So in order to achieve a higher required rate of return, the asset must be discounted 
and thus will be sold at a lower price. Figure 8 reveals this inverse relationship. 

Moreover, we use the mean pricing errors (MPE) and the root mean squared errors 
(RMSE) given by 



^Eor a derivation of the state equation see Appendix 2. 

^Eor an economical explanation see [11], Chap. 27. or [17], Chap. 30. 
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Table 2 Test of model parameters at significance level 5 %, sample size 1536 



Parameter 


Coeff 


Std Err 


t-test 


p-value 


0 


1.5130 


0.3195 


5.7601 


0.0000 


k 


0.4091 


0.6117 


4.0641 


0.0001 


crx 


0.2913 


0.0193 


17.6365 


0.0000 


P 


0.0017 


0.0016 


9.0910 


0.0000 


a 


-1.5772 


0.0256 


61.5603 


0.0000 


p 


0.0172 


0.0005 


35.6312 


0.0000 




Fig. 7 MPR, futures price and price transformation from Jan. 2007 to Dec. 2012 




Fig. 8 Negative correlation of MPR and price transformation 



1 ^ 

MPE = -Y,(kr - ^U,r), 

ti=l 
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ti=l 
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Table 3 Performance of MPE and RMSE with 2000 observations 



Maturity 


MPE 


RMSE 


1 month 


-0.0153 


0.0182 


3 months 


-0.0208 


0.0234 


6 months 


-0.0366 


0.0397 


9 months 


-0.1273 


0.1302 



Sample Autocorrelation Function 
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Fig. 9 Statistical tests for the residuals in trading phase 2 



respectively, to assess the quality of model fit. Here N denotes the number of obser- 
vations, ^ti,T is the estimated price to maturity r , and is the observed price. Their 
values can be seen in Table 3. The absolute values of MPE and RMSE increase with 
time but still remain very low even 9 months before the maturity. Therefore, the 
conclusion is that the model is able to reproduce the price dynamics. 

In Eig. 9 we show the standard statistical test results of the residuals by taking into 
account the dynamic market price of risk. Comparing with the results from Eigs. 2, 3, 
4, 5 and 6, the time series of the residuals is relative stable with smaller variance. The 
sample auto-correlations and sample partial auto-correlations reveal very weak linear 
dependence of the variables at different time points. Also, the Q-Q plot indicates a 
better fit of a Gaussian distribution. 



4 Option Pricing and Market Forward Looking Information 

A general pricing formula of a European call is given by 
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where {r 5 } 5 e[ 0 ,r] stands for a deterministic rate, At denotes the futures price, ^ > 0 
is the strike price, and r G [0, Y] is the maturity. The normalized price process at is 
given by at = At jP, where P denotes the penalty for each ton of exceeding emissions, 
and therefore we have At — P<P (§?). A call option price formula written on EUA has 
been derived by Carmona and Hinz [2] under the assumption of a constant market 
price of risk. Under the assumption of a dynamic market price of risk, the option 
price formula is coherent with the formula in [2] given by 



where cp stands for the density function of a standard normal distribution. Here jJit,T 
and are the parameters of the distribution of which is conditional Gaussian. 
Under the risk neutral measure Q, iJLt,T and are given by 



In the following example, the penalty level is P = 100, the initial time t = 0 
starts in April 2005. EUA futures has maturity T on the last trading day in 2012. The 
European calls written on EUA futures with a strike at ^ = 15 and maturity T will 
be considered under a constant interest rate at r = 0.05. Eigure 10 shows the call 
option prices and the futures prices. The red curve stands for the option prices under 
dynamic MPR while the green curve stands for the option prices under constant 
MPR. 

To measure the impact of the dynamic market price of risk on the EUA option for 
different strikes we calculate the option price in the univariate and bivariate model 
setting respectively. Durations from 1, 3, 6 and 12 months to maturity are chosen for 
calls written on EUA 2012. The results are plotted in Eig. 1 1 . The red curve stands for 
the option prices evaluated by the bivariate model and the blue curve by the univariate 
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Fig. 10 Futures price and call option prices with ^ = 15 from 2005 to 2012 
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Call option on 11.17.2012 



Call option on 09.17.2012 





K (strike price) 

Call option on 06.17.2012 



K (strike price) 

Call option on 12.17.2011 





Fig. 1 1 Call option prices comparison for durations of 1 , 3 , 6, 1 2 months on EUA 2012 for different 
strikes 



model. The green line is the corresponding futures price at the given time. In most 
cases, one is interested in the option prices near the underlying price. According to 
the figure, the option prices in different model settings coincide except for a interval 
around the corresponding futures. In a short time before the maturity of EUA 2012, 
Fig. 1 1 shows a price overestimation by the constant MPR. This result is consistent 
with the result shown in Fig. 10, where we take ^ = 15 as a sample path. 

Moreover, one notes that the call price process with constant MPR develops below 
the call process with dynamic MPR in the first trading phase before 2008 and then 
increases slowly and moves to the upside of the call process with dynamic MPR dur- 
ing the second trading phase, before both processes vanish to the maturity because of 
lower underlying prices. The reason for the price underestimation before 2008 and 
overestimation thereafter can be explained as the assumption of a constant MPR in 
the whole trading periods and thus causes a neglect on the information of the market 
participants. Due to the regulatory framework of the carbon market, certificates carry 
information on the market participant expectations on the development of the fun- 
damental price drivers. Since the implied risk premia increase with time and exceed 
their ’average’ level in 2008, asset price must be discounted to compensate the higher 
risk. By using appropriate valuation models, this risk premia and the forward-looking 
information carried by prices of futures and options of certificates can be extracted. 



5 Conclusion 

We extract forward-looking information in the EU ETS by applying an extended 
pricing model of EUA futures and analyzing its impact on option prices. We find 
that the implied risk premium is time-varying and has to be modeled by a stochastic 
process. Using the information given by the risk premium we show that the option 
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prices during the first and second trading phases are underestimated and overesti- 
mated, respectively. The reason for the pricing deviation is caused by the assumption 
of a constant market price of risk which rigidifies the market participant expectations 
on the development of price drivers. The over- and underestimated prices are mostly 
concentrated in the interval including the corresponding futures, which is the area 
where the price most likely will evolve in the future. Although there is not a closed 
form for the option pricing formula, a simple numerical approach can be used to 
determine the price. 

Open Access This chapter is distributed under the terms of the Creative Commons Attribution 
Noncommercial License, which permits any noncommercial use, distribution, and reproduction in 
any medium, provided the original author(s) and source are credited. 



Appendix 1 

In order to show the condition in (8), it is sufficient to prove the Novikov’s condition 
given by 



E[exp(-^ < 00- 

In the bivariate EUA pricing model, where kt follows a Ornstein-Uhlenbeck-Process 
given by 



this condition is always satisfied. 

Proof We first show that there exists a constant £ > 0 such that for any S G [0, T], 
we have 



To show (19) we consider the term in the expectation notation. By applying Jensen’s 
inequality we have 



kt = 0{k — kt)dt -h (JxdWt, 




(19) 




By applying Fubini’s theorem (19) becomes 






( 20 ) 
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The process Xt is a Gaussian process with mean and variance given by 



E[Xt] = !Xt= Xoe~^‘ + X(l - 



„-9t\ 



Var^f) 



1^(1 

29 



We have X( ~ V)- Z be a standard normal-distributed random 

variable Z ~ ./K(0, 1). So in (20) we have 

E|^exp (|a.?)] = E[exp + a,zfy^ 

= E|^exp (^ -I- ejXtOtZ + 

f 8 g}x^\ 1 / r 






sa, 



t Jl 



X + SfltCft^ 



^dx. 



To calculate the integration term above, let at = I — saf and bf = and make 

the integral- substitution. Then we have 

f I / 1 “ 2 \ T 

/ exp I V -\- £iitCft^]dx 

7M V27r \ 2 / 

= [ — ^ exp ( - l;(atx^ - 2btx))dx 
Jr v27r v 2 / 

€)■ 












According to the assumptions a, = 1 — is positive and the expectation is con- 
vergent for a small s and its value is 



E 






1 - ecr/ 



exp(— )exp(— 
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Thus the integral in (20) is finite and the exponential term in (19) is integrable. 

In order to show is a martingale we first consider that is a local martingale, 
hence it is a supermartingale. Therefore, Z^ is a martingale if and only if the condition 
E[Z^] = 1, Vr G [0, T], is satisfied. This martingale property can be shown by 
induction. Suppose E[Zo] = 1 which is trivial and E[Z^] = 1 for r g [0, S] for 
S < T. Let now r G [5, 5 + e] and set 




According to Novikov condition and (19), Z^ is a martingale. Then we have 



E[4] = E[ZsZ^] = E[E[Z5 Z^]|^s] = E[Z5E[Z^|^s]] = E[ZsZ|] = E[Zs], 



since 





= exp(O) = 1. 



It follows 

E[Zt]=E[Zs] = l fort e[S,S-\-s]. 

Then we have E[Z{] = 1 for r g [0, 5 + e]. Repeat this induction for times 
we have E[Zt] = 1 for r g [0, T], which implies Zt defined in (8) is a 
martingale. □ 



Appendix 2 

The bivariate EUA pricing model can be described as follows: 

Xty. = (1 — 0 + OXAt + 

where and are both random variables of the standard normal distribution. We 
want to put the model into the state space form. Price transformation depends on 
the current level of the market price of risk, which is an unobservable variable and 
therefore must be modeled in the equation of . We first let 



-P ^ > 



Pricing Options on EU ETS Certificates with a Time- Varying . . . 



359 



where and are both random variables of the standard normal distribution as 
well. This fact can be easily seen since we have 



Cov{i,lJtP = Cov\ 



(<■ 



= 0 . 



Note that 






^tk-i + — 0- 



Multiplying — <r>.p ^ at the both sides of the equation and sum it to the equation 

of A,(j, , it follows that 



Xtk = (1 - - (TxpAa,^_j + eXAt 

oxP 






(1 — OAt — axpAt)Xfj^_^ + 
+ ax 1 - p2^2 _ 



OXAt — 



This is the transition equation in the state space form, and the measurement equation 
would be 



^tk ~ ^tk+\ ~ + 




^tk + s/ztk^t'^l- 
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